BULLETIN 


OF THE 


AMERICAN 
MATHEMATICAL SOCIETY 


EDITED BY 


EARLE R. HEDRICK 
DAVID R. CURTISS WILLIAM R. LONGLEY 


WITH THE ASSISTANCE OF 


HAROLD T. DAVIS HARRY W. KUHN 

THEOPHIL H. HILDEBRANDT DERRICK N. LEHMER 

JOHN R. KLINE HENRY L. RIETZ 
CAROLINE E. SEELY 


VoLtuME XXXVIII, NumsBer 4 
Apriz, 1932 


PUBLISHED BY THE SOCIETY 
MENASHA, WIS., AND NEW YORK 
1932 


PusiisHED MONTHLY 


Whole No. 391 $9.00 a Year 


Entered as second class ma..er April 24, 1926, at the post office at 
Menasha, Wis., under the act of August 24, 1912. 


AMERICAN MATHEMATICAL SOCIETY 


Mail Address: 501 West 116th Street, New York, N.Y., for the 
Secretary, Treasurer, Librarian, and Clerk. 


Office: Room 413, Physics Building, Columbia University, New York 
City. 
President: Dean L. P. Eisenhart, Princeton University, Prince- 
ton, N.J. 


Secretary: Dean R. G. D. Richardson, Brown University, Provi- 
dence, R.I. 


Associate Secretaries: Professor Tomlinson Fort, Lehigh Uni- 
versity, Bethlehem, Pa.; Professor M. H. Ingraham, Univer- 
sity of Wisconsin, Madison, Wis.; Dean T. M. Putnam, 
University of California, Berkeley, Calif. 


Treasurer: Professor G. W. Mullins, Barnard College in Colum- 
bia University, New York City. 


Librarian: Professor R. C. Archibald, Brown University, Provi- 
dence, RI. 


Clerk: Dr. Caroline E. Seely, 413 Physics Building, Columbia 
University, New York City. 


Editor-in-Chief of the Bulletin: Professor E. R. Hedrick, Uni- 
versity of California at Los Angeles, Los Angeles, Calif. 


Managing Editor of the 7'ransactions: Professor R. D. Car- 
michael, University of Illinois, Urbana, IIl. 


Chairman, Colloquium Editorial Committee: Professor R. L. 
Moore, University of Texas, Austin, Tex. 


Managing Editor of the American Journal of Mathematics: Pro- 
fessor Abraham Cohen, Johns Hopkins University, Balti- 
more, Md. 


Representatives of the Society in the Division of Physical Sci- 


ences of the National Research Council: Professors James 
Pierpont, D. R. Curtiss, and E. R. Hedrick. 


Representatives on the Council of the American Association for 
the Advancement of Science: Professors Louis Ingold and 
M. H. Ingraham. 


Society Visiting Lecturer for 1931-32: Professor R. L. Moore, 
University of Texas, Austin, Tex. 


Colloquium Speaker for 1932: Professor J. F. Ritt, Columbia 
University, New York City. 


Chairman, Committee on Arrangements for the Semi-Centennial: 
Professor T. S. Fiske, Columbia University, New York City. 


_ 
| 

| 

| 

| 


THE CALCULUS OF VARIATIONS AND THE 
QUANTUM THEORY* 


BY G. A. BLISS 


1. Introduction. Several years ago an insistent curiosity caused 
me to forsake temporarily fields of mathematical interest more 
legitimate for me in order to find out if possible the character 
of the new quantum theories in which my physicist neigh- 
bors and some of my mathematical friends seemed to find so 
much of scientific interest and excitement. I was entirely ignor- 
ant of the theory at that time and was greatly surprised to find 
that my own specialty, the calculus of variations, through the 
media of mechanical and optical conceptions, had played an 
important role in its development. The quantum theory was 
then in an exceedingly fluent stage, and though the unanswered 
mathematical questions which presented themselves were nu- 
merous and significant it seemed to me useless for mathema- 
ticians to spend large amounts of time in endeavoring to perfect 
mathematical details when some new paper might at short no- 
tice cause fundamental changes in the whole structure of the 
theory. Every one who has followed the development of the 
quantum theory would agree, I think, that this impression was 
justified. At the present time the situation seems somewhat dif- 
ferent. I have heard assurances from various reliable sources 
that the leaders in the promotion of quantum mechanics, men 
of wide physical experience and of daring in mathematical ex- 
ploration distinctly beyond that of the average pure mathe- 
matician who takes his rules so seriously, have reached some- 
thing like agreement. These statements should perhaps not be 
accepted too readily, as they sound like others that have been 
made in the past and which have afterward needed modifica- 
tion. But to the seeker after knowledge, less experienced in this 
field like myself, it also seems that an equilibrium has approx- 


* Retiring Address of the Vice-President of Section A of the American As- 
sociation for the Advancement of Science, delivered at New Orleans, December 
29, 1931, at a joint meeting of this Society, the American Physical Society, and 
Section A of the A.A.A.S. 


201 


202 G. A. BLISS [April, 


imately been reached in the fundamentals of the quantum 
theory, and that pure mathematicians could therefore reasona- 
bly endeavor to contribute in a valuable way to the develop- 
ment and perfection of its mathematical structure. 

In the following pages I do not intend to attempt to point 
out mathematical defects in the quantum theory, though it 
seems clear that such defects exist. They appear for the most 
part in the generalities and less often in the special problems 
which have so far been successfully studied. My endeavor here 
will rather be to present some of the mathematical aspects of 
the theory to those of my mathematical colleagues who have 
not yet delved in the field. They will agree with me, I think, 
that there is no other applied mathematical domain whose de- 
velopment has involved a wider variety of mathematical tech- 
niques. 

This paper was written originally as a retiring address for a 
chairman of the mathematicians in Section A of the Association. 
I was somewhat perturbed when I found that it had been allotted 
to a joint meeting of mathematicians and physicists. To those 
of the latter class who are here present I can only express my 
sympathy. It will be impossible for me to enlighten them with 
regard to quantum mechanics, and my mathematics may seem 
to them too technical. My only comfort is that it may possibly 
not seem more so than that of their own colleagues who special- 
ize in the quantum theory. 


2. Normal Congruences and Optical Fields. One of the simplest 
problems of the calculus of variations is that of finding in the 
class of arcs 


y = 2 = 2(x), (11 S x 


joining two fixed points in xyz-space, one which minimizes the 
length integral 


f= | + + 


This is a relatively simple problem concerning whose solution 
we know much intuitively. The possible minimizing curves are 
straight lines 


(1) z= cx +d, 


y=ax+, 


1932.] QUANTUM THEORY 203 


the totality of which depends upon four constants a, J, c, d. 
A congruence of lines is a two-parameter family selected out of 
this four-parameter family by setting a, b, c, d equal to functions 
of two other parameters u, v. Geometers have long known that 
the lines of such a congruence will not in general have a surface 
which they all cut at right angles. If, however, there is one sur- 
face cutting all the lines of the congruence orthogonally, then 
there is a whole family of such surfaces given by an equation of 
the form 


(2) S(x, y, 2) = constant. 


The congruence is in that case called a normal congruence. Such 
congruences of lines have many interesting properties. For ex- 
ample the segments of lines of the congruence intercepted by 
two of the surfaces (2) are all equal in length, and the congru- 
ence has so-called focal surfaces to which all of the lines of the 
congruence are tangent. But the property which interests us 
most here is that a normal congruence may be determined by 
its family (2) of orthogonal surfaces as well as by its lines. Not 
every fatnily (2) has a congruence of straight lines cutting it at 
right angles. The characteristic property of those families of 
surfaces which do belong to a normal congruence is that for 
such families the function S(x, y, 2) can always be selected so 
that it satisfies the differential equation 


+ S? + S2 = 


where S,, S,, S, are the partial derivatives of S with respect 
to x, y, and z. 

All this will seem rather familiar to my physicist hearers be- 
cause the principal business of the theory of geometrical optics 
is with normal congruences. The straight lines are the light rays, 
the orthogonal surfaces are the wave fronts, and the focal sur- 
faces are the caustic surfaces of optical theory. An interesting 
property of an optical field of rays for the physicist is of course 
that it retains its character after a reflection or a refraction. Sir 
William Hamilton devoted one of his early published papers* 
to the study of the properties of such optical fields. It was the 


* Theory of systems of rays, Transactions of the Royal Irish Academy, vol. 
15 (1824). pp. 69-173; vol. 16, Part I, pp. 3-61, Part II, pp. 93-125; vol. 17, 
pp. 1-144. 


| 


204 G. A. BLISS [April, 


more than 300-page effort of a youthful writer, and probably 
unnecessarily long, but it contains the source of the ideas which 
later influenced profoundly the development of the calculus of 
vaaritions and the theory of analytical mechanics. 


3. Generalizations in the Calculus of Variations. It has been 
found that properties analogous to those of normal congruences 
are possessed by the minimizing curves of far more general 
problems of the calculus of variations than that of determining 
the shortest distance. When we try to minimize an integral 


l= f f(x; V1, V2; yz dx 


in the class of arcs 

= = y2(x), (x1 S x S %2) 
joining two fixed points in xy:ye-space we find that the only pos- 
sible minimizing curves are the solutions of the differential 
equations 


(3) : 


dx 


= fy, (k = 1, 2). 


I use the notations x, y;, y2 instead of x, y, z because then all of 
our equations can be readily interpreted for the (r+1)-dimen- 
sional case, instead of the 3-dimensional case, by simply chang- 
ing the range k=1, 2 to k=1, - - - , r. Meanwhile, for the space 
of points (x, yi, ye) we can use the language of ordinary 3- 
dimensional geometry. The differential equations (3) are of the 
second order and their solutions, which are called extremals in 
the calculus of variations, form a family of curves whose equa- 
tions, like those of the straight lines (1), contain four arbitrary 
constants. 

It has been found, following Hamilton and probably earlier 
writers also, that the equations (3) take a very pleasing form 
when new variables x, y;, 2, are introduced in place of the 
original variables x, y:, yz’ by means of the equations 


(x, y’), (k 1, 2). 


Let us define a Hamiltonian function H by means of the equa- 
tion 


H(x, y,2) = yefu’ —f, 


= 


1932.] QUANTUM THEORY 205 


it being understood here as elsewhere in this paper that a term 
containing the same subscript k twice stands for the sum of the 
terms of that form for the various special values in the range of 
k, as in tensor analysis. It can be proved that the equations of 
the extremals now take the form 


dz, 


= — k = 1, 2). 
vi ( ) 


(4) 

The variables x, y,, 2, are called canonical variables, and the 

equations (4) are the canonical equations of the extremals. 

Equations (4) may also be written in the form 

dy: dz, 

(5) (ve, oF (zx, H), (k = 1, 2), 
dx dx 

where the brackets on the right are so-called Poisson brackets 

defined for two arbitrary functions f and g of x, yz, 2 by the 

equation 


(6) 
8) Oy, OVE 
For the fields of geometrical optics the equation which is 
satisfied when the direction dx:dy:dz in xyz-space is at right 
angles with a direction 1:y’:z’ is the well known orthogonality 
condition 


dx + y’ dy + dz =0. 


In the calculus of variations it is found that this equation must 
be replaced by the equation 


Cf — ye + = 0. 


A direction dx:dy,:dy2 is transversal to the direction 1:y1':y2 
at the point (x, y) if it satisfies this equation; and a surface is 
transversal to 1:y;':y2' at (x, y) if all the directions dx:dy,:dy2 
tangent to the surface at (x, y) satisfy the equation. 

We are now in possession of data sufficient to justify a gen- 
eralization of the notion of a normal congruence of straight 
lines. Instead of a two-parameter family of straight lines we con- 
sider a two-parameter family of the extremal curves which 
satisfy the differential equations (3) or (4). Such a congruence 


| | | | 


206 G. A. BLISS {April, 


of curves is called a Mayer family, after the name of its dis- 
coverer, if there is a single surface which cuts every member of 
the family transversally. It is provable, then, that there is a 
single infinity of such surfaces, the so-called transversal sur- 
faces of the Mayer family, defined by an equation of the form 


S(x, yi, Y2) = constant, 


and analogous to (2). Not every such set of surfaces is the set of 
transversal surfaces of a Mayer family. In order that this 
property may be possessed by the surfaces it is necessary and 
sufficient that they should be definable by an equation S(x, y) = 
constant for which S satisfies the partial differential equation 


S2+ H(x, y, Sy) = 0. 


This is the famous Hamilton-Jacobi partial differential equa- 
tion of the calculus of variations. It has many interesting rela- 
tionships with the extremals besides the one which has just 
been described. 

We need one further notion from the calculus of variations in 
order to complete the list of those upon which the development 
of the quantum theory has depended. The variables x, yx, 2% 
introduced above are not the only ones for which the differential 
equations of the extremals have the canonical form. Trans- 
formations 


(7) += x(X, Z), yi(X, Z), z(X, Z), 


which take x, y;, 2 into new variables X, Y;, Z, and which 
transform the canonical differential equations into others of the 
same form are called canonical transformations. Such trans- 
formations can be constructed in many different ways but the 
most important ones for the quantum theory are those which 
replace (x, 2.) by new variables (X, Yi, =(x, wz, Jz) by 
means of equations of the form 


where S(x, y, J) is a solution of the differential equation 
S:+ H(x, y, S,) = EV), 


involving the variables J; as parameters. The last set of equa- 
tions (8) must be solved for the y; and substituted in the second 


— 
= 


1932.] QUANTUM THEORY 207 


in order to secure the transformation in the form (7). The im- 
portant property of the transformation is that in terms of the 
variables x, w;, J; the differential equations of the extremals 
take the very simple form 


dw, dJ 


Jk» 


dx dx 
and the equations of the extremals are 
(9) w, = Ey,(J)t+ ar, Je = Bi, 


where the a;’s and £;’s are constants. 

The results from the calculus of variations mentioned in the 
preceding paragraphs have been described for the 3-dimen- 
sional space of points (x, y1, ye). But the equations which have 
been written are valid for the (r+1)-dimensional space of points 
(x, v1, --- , ¥r) if we change the range of the indices k from 1, 2 
to 1,---,7. The number of parameters in the equations of the 
family of extremals is of course 27 instead of 4, and the Mayer 
families for (r+1)-dimensional space contain 7 parameters in- 
stead of only 2. 


4. Applications of the Calculus of Variations. The calculus of 
variations appears in the quantum theory through its applica- 
tions in mechanics. The equations of motion of a particle of 
mass m moving in a field of force with the potential function 
V(x, y, 2) are the equations 


dmx dmy’ dmz 
(10) =-—YV,, Vy, = —V,, 
dt dt dt 


where the primes indicate derivatives with respect to ¢. Ac- 
cording to the well known principle of Hamilton they are the 
differential equations of the extremals of the integral 


te 
(11) I -f (T — V)dt, 

where T=m(x"+y”"+2”)/2 is the kinetic energy of the mov- 
ing particle. Thus all of the results of the calculus of variations 
described in the preceding section, with suitable changes in 
notation, find their applications in mechanics. It is not neces- 


= 
= 


208 G. A. BLISS [April, 


sary to review them in detail for this special case. It should be 
mentioned, however, that a similar formulation of Hamilton’s 
principle holds for m particles instead of one, and for any 
auxiliary variables g; in terms of which the coordinates of the 
particles may be expressed. The variables ¢, g:, qi’ replace the 
Xx, Ye, ye’ Of the preceding section. The kinetic energy T is a 
homogeneous quadratic form in the derivatives g,’ with coeffi- 
cients functions of the g,, and the canonical variables are ft, 
gx, De, where the momenta ~; are defined by the equations 
Pr=T,,. The Hamiltonian function H(t, p, g) turns out to be 
the total energy H=T+ V of the system and is equal to a con- 
stant E for the motion of every system. 

This is a familiar basis for the theory of Newtonian analytical 
mechanics, but the modification of equations (10) which arises 
in the theory of relativity should also be mentioned. In the non- 
relativistic case the mass m is a constant, but in the relativity 
theory it varies in accordance with the law m =m)/(1—v?/c?) "2, 
where mp is the so-called rest-mass, v is the velocity of the parti- 
cle, and ¢ the velocity of light. The equations (10) are no longer 
the differential equations of the extremals of the integral (11) 
when T has the value described above. They will retain this 
property, however, if we replace T by the function 


Ti = moc?[1 — (1 — v?/c?)¥/2], 


as can readily be verified. The Hamiltonian function for this 
relativity case turns out to have the value H = 72+ V, where the 
kinetic energy 72 is now given by the formula 


T2 = moc?[(1 — 02/c?)-1/2 — 1]. 


There are two further principles, important for the purposes 
of this paper, which are phrased in terms of concepts of the 
calculus of variations. The principle of least action in mechanics 
says that every trajectory of a particle of mass m, moving with 
total energy E in a field of force with the potential function V, 
is an extremal of the integral 


= f ‘[2m(E — V)(x? + + 22) 


(12) 


= — V)]*/ds, 


= 


1932.] QUANTUM THEORY 209 


where 7 is a parameter in terms of which the coordinates x, y, z 
of the trajectory are to be expressed and s is the length of arc. 
The time ¢ at which the particle arrives at the place defined on 
the trajectory by a value of 7 is expressible by the equation 


[m(x? + y? + 27)/(2(E — V)))"dr. 


Fermat’s principle in geometrical optics has a form much like 
that of this mechanical principle of least action. It says that 
rays of light in a medium in which the velocity of light at a 
point (x, y, 2) is u(x, y, z) are extremals of the integral 


where C is an arbitrary constant whose value does not affect 
the determination of the extremals in any way. The two integrals 
(12) and (13) are identical if 


(14) u = C/[2m(E — V)]"?2. 


Hence the dynamical trajectories of a particle moving with 
total energy E are the same as the paths of light in a medium 
where the velocity of light at each point is given by the equa- 
tion (14). 

John Bernoulli used an analogy similar to this in 1697 when 
he determined the curve down'which a particle would fall under 
the action of gravity from one fixed point to another in the 
shortest time. He found that the time of descent was given, ex- 
cept for a constant factor, by the integral 


82 
f 
81 


Hence the curve of quickest descent must be a light-ray in a 
medium where u=Cz'/?, By dividing the medium into horizon- 
tal layers and applying the law of refraction he was able to find 
the differential equation of the minimizing curve and to show 
that the curve of quickest descent is an inverted cycloid. The 
paper in which he found this result is often designated as the 
origin of the calculus of variations. 


210 G. A. BLISS [April, 


5. The Quantum Theory of Bohr. In my effort to give my fellow 
mathematicians some idea of the role which the calculus of 
variations has played in the development of the quantum theory 
I do not hesitate to begin with the early theory of Bohr, even 
though it has been superseded in recent years by quite different 
mathematical mechanisms. It still affords a convenient picture 
by means of which the physicist correlates a considerable part 
of the data in which the quantum theory had its source, and 
with the theory of Schrédinger it forms the chapters of the 
development of the quantum theory in which the calculus of 
variations had its greatest influence. Furthermore I believe that 
a knowledge of these earlier stages is essential to an understand- 
ing of quantum mechanics in its more recent forms. 

To the mathematicians I may explain that one of the first 
problems of the quantum theory, and one of the least compli- 
cated illustrations of it, is the theory of the spectrum of hydro- 
gen. On the one hand we have the spectrum, itself, as shown in 
the figure which is drawn without reference to scale. The lines 
fall into four series which have been observed experimentally, 
and others not indicated, each of which has lines converging to 
a dotted limiting line. Each line is due to light of a particular 
wave length A, and when the figure is drawn to scale the lines 


Infra-red Visible Ultra-violet 


Brackett Ritz Balmer Lyman 
Paschen 


are spaced proportionally to their wave numbers v=1/X. It has 
been found empirically that these wave numbers are given with 
great accuracy by the famous Balmer formula 


vy = R(1/m? — 1/n?), 
in which R is a constant and m and m are integers. The Lyman 


series is the sequence of lines corresponding to m=1 and 
arbitrary positive integral values for x, the Balmer series to 


1932.] QUANTUM THEORY 211 


m=2, and so on. Associated with this picture was the physi- 
cist’s conception of a hydrogen atom as a dynamical system con- 
sisting of a heavy positively charged nucleus with a much 
lighter electron carrying an equal negative charge revolving 
around it. What could be the connection between the two? 

To answer this question, Bohr assumed that the electron 
is not free to move in all of the orbits which would be permitted 
by ordinary dynamical theory, but only in any one of a de- 
numerable infinity of orbits, the so-called stationary states of 
motion, selected in a way which will presently be described. 
Radiation was supposed to be emitted only when the electron 
fell from one of these orbits to another in which it had a smaller 
total energy, and the wave number of the radiation emitted 
was proportional to the difference E,— EZ, between the total 
energies of the atomic system in the initial and final states. This 
somewhat crude dynamical picture corresponds to the fact that 
hydrogen gas does not emit radiation except when suitably ex- 
cited, that is, except when the electrons in its atoms have been 
knocked by a bombardment of some sort into stationary states 
with larger total energies from which they can fall to states 
with smaller energy-values. Similar assumptions were made for 
much more complicated atoms with many electrons revolving 
around their nuclei. 

The calculus of variations was of assistance in the specifica- 
tion of the stationary states and the determination of their 
orbits. It provided canonical transformations from the variables 
t, qe, pe Of the atomic system to new canonical variables /, 
wr, J; which had among other properties those described in 
equations (8) and the following equations of §3 above. For an 
atomic system these equations do not involve ¢ and have the 
somewhat simpler form 


(15) pi = J), Wm = J), 
(16) H(q, Sq) = EJ). 


The equations of the trajectories in the new variables are iden- 
tical with equations (9) which show among other things that 
the J’s are really constants of integration. In accordance with 
assumptions which had proved successful in the simplest cases 
the quantum orbits were determined by values of the J’s of the 
special form J,=n,h, where , is an integer and / is Planck’s 


212 G. A. BLISS [April, 


constant, and the frequency v=c/A of the emitted radiation 
when the system fell from a stationary state of total energy E; 
to one of total energy E: was to be given by the formula vh 
= E,—E,. This last equation, when the extremals for two sta- 
tionary states of the hydrogen atom were determined and -the 
total energies E,, E, expressed in terms of the integers m, n be- 
longing to the two states, gave the Balmer formula. It was the 
first great success of the theory. 

But the agreement between theory and experimental results 
was even more striking than this. When hydrogen gas is excited 
in a constant electric force field each spectrum line is replaced 
by a symmetrically situated group of neighboring lines, and a 
similar effect with a different type of group is produced by a 
constant magnetic field. By modifying the potential function V 
of the atomic system to take account of these electric or mag- 
netic forces, formulas for the frequencies v, agreeing well with 
those observed on the spectrum, were found by the process de- 
scribed in the preceding paragraphs. It was still more interest- 
ing perhaps that the fine structure of the lines of the spectrum, 
detectable by more delicate measurement, was accounted for by 
replacing Newtonian mechanics by the mechanics of the rela- 
tivity theory, as described in §3 above. These were indeed im- 
pressive agreements between observation and theory. 

For more complicated atoms the mathematics became of 
course much more difficult and was theoretically somewhat dis- 
turbing. Besides the properties listed above the canonical trans- 
formation (15) was further required to satisfy conditions of the 
form 


where the integrals were taken throughout a period of the mo- 
tion of the atomic system. These could be satisfied by trans- 
forming the J’s to other similar constants when the integral 
S(q, J) of equation (16) was a sum of functions each containing 
only one of the variables g;, and progress had been made in the 
study of more general cases. But I think it must have been a 
relief to many when at this stage the quantum theory was given 
an entirely new form by the advent of the methods of Heisen- 
berg and Schrédinger. 


1932-] QUANTUM THEORY 213 


Before we take up the consideration of these later develop- 
ments we should note that for canonical variables w;,, J; a con- 
sequence. of the equations (17) was that each coordinate q of a 
particle in the periodic atomic system was expressible as a func- 
tion 

q = q(w, J) = q(w, nh), 


which was periodic with period unity in each variable w,. On 
account of this periodicity g could be expanded, for a quantum 
orbit having J,;=n;h, in a Fourier series of the form 


where the sum is taken for all sets r=(71, - - - , 7-) of positive 
and negative integers. This is an ordinary multiple Fourier series 
with the sines and cosines replaced by their values in terms of 
exponentials, and it is readily provable that c(m, —7) and c(n, 7) 
are conjugate imaginaries. The corresponding equation for g 
in terms of the time is found by substituting the values (9) for 
the w’s and has the form 


(18) LO(n, 


where 


For a fall from a state defined by integers m to one defined by 
m’s the frequency of the emitted radiation is 


v(m, n) = [E(nh) — E(mh)]/h 


— mx) [eam + A(n — m) |d0, 


by Taylor’s formula with integral form of remainder term. This 
is a mean value between the frequencies N(n, 7) and N(m, 7) 
of the two terms having t,=m,—m, in the series (18) for the 
two orbits. The correspondence principle of Bohr resulted in the 
assumption that the radiation emitted would be like that of a 
set of simple oscillators whose g-coordinate has the mathematical 
representation 


(19) q(m, 


where g(m, n) is the corresponding mean value 


214 G. A. BLISS [April, 


q(m, n) = f Q[m + 0(n — m), r]d0, (rx = me — mk), 


between the coefficients Q(n, rT) and Q(m, rT). By means of this 
further rather doubtfully justifiable hypothetical specification 
of the character of the radiation, theoretical polarizations and 
intensities of spectrum lines were calculated whose agreements 
with observed data were close enough to be surprising. This 
situation is, however, a physical one of rather refined delicacy 
for a mathematician to discuss, and I will emphasize further 
therefore only the important result that corresponding to every 
coordinate g of the atomic dynamical system the theory gave a 
matrix of elements of the form (19) for which elements situated 
symmetrically with respect to the principal diagonal are con- 
jugate imaginaries. Such a matrix is called by the mathemati- 
cians a Hermitian matrix. 


6. The Theory of Schridinger. The quantum theory as pro- 
posed by Schrédinger is based upon a very interesting extension 
of the analogy of Bernoulli between optical paths and mechan- 
ical trajectories described near the end of §4 above. His argu- 
ment can be phrased as follows. It has been seen that the ex- 
tremals for the integral (13) in the optical field are the same as 
the mechanical trajectories defined by the action integral (12) 
when the light-velocity u is given by equation (14). In an opti- 
cal field, however, the theory of geometrical optics is not suffi- 
cient to explain minute optical phenomena, such as interfer- 
ence and diffraction. To have a satisfactory theory of these 
latter phenomena, it has been found necessary to associate with 
every field of light rays solutions of the wave equation 


(20) Were + Wyy + = Wr /U?. 


It seems reasonable, therefore, in the effort to explain minute 
atomic mechanical phenomena, to associate with mechanical 
trajectories solutions of the wave equation with the value of u 
defined by (14). 

The constant C now assumes a more important role. In order 
to determine its value, Schrédinger made two assumptions. The 
first was that the frequency v of the wave to be associated with 
the moving particle should be related to the total energy E of 
the particle by the equation E=hy, as de Broglie had earlier 


1932.] QUANTUM THEORY 215 


suggested; the second was that the group velocity of the one- 
parameter family of waves associated with the particle for dif- 
ferent values of E should be equal for each E to the velocity v 
of the particle itself. It would take too long to explain here the 
mathematically somewhat vaguely formulated definition of 
group velocity. For mathematicians who may be interested the 
best discussion which I know of is that of Fraenkel.{ The result 
of the two assumptions is that C=E, so that the final value for 
u? in the wave equation is 


u? = E*/[2m(E — V)]. 
If we make the substitution 
(21) w= y, 


in equation (20), in order to determine the wave functions 
which have frequency v= E/h, we find the new equation for y 


(22) Vez Vuy Vez + V)/h? == (). 


This is the fundamental equation upon which Schrédinger based 
his theory of the mechanics of an atom. 

With each state of motion of a dynamical system having a 
potential function V(x, y, z) there is to be associated a real or 
imaginary solution ¥(x, y, 2) of the wave equation (22) which 
has continuous first and second partial derivatives everywhere 
in xyz-space, which vanishes at infinity, and which furthermore 
has first partial derivatives Y, with respect to the coordinates 
q=x, y, 2 such that the products r*f, are bounded, where r? 
=x?+y?+2?, as is customarily assumed in the potential theory. 
The determination of solutions y with these properties is what 
the mathematicians call a boundary value problem, the param- 
eter in equation (22) which they would usually denote by 
being E. For the mechanical system of the hydrogen atom, for 
example, it turns out that the only negative values of E for 
which the problem has solutions form a denumerable infinity 
E,, E2,--+, and these are precisely the energies of the Bohr 
stationary states. Thus the Schrédinger theory singles out 
automatically the possible energy states for the electron, which 
in the Bohr theory had to be specified by the special assumptions 
J; = n;h. 


t Einfiihrung in die Wellenmechanik, p. 24. 


i 


216 G. A. BLISS [April, 


To every value E, there corresponds a finite number of solu- 


tions 
= 


of the wave equation (22), and these solutions can be normed 
and orthogonalized so that 


where y,* is the conjugate imaginary of y, and the integrals here 
and in the following equations are taken over the whole of space. 
In accord with analogies in electro-magnetic theory it was 
further assumed that the radiation emitted when the system fell 
from a state of higher energy E, to one of lower energy En 
would be that of a simple oscillator with coordinates g=x, y, 2 
of the form 


(23) f dxd ydz, 


where w,* is the conjugate imaginary of w,. The expression (23) 
has the value 


(24) q(m, 2) tlh, 
with 
q(m, n) = fe dxdydz. 


The assumptions of the theory therefore lead automatically to 
a matrix of elements (24) of the same form as that which arose 
from the Bohr theory. 

Mathematically, the crucial part of the Schrédinger theory is 
the determination of the characteristic numbers E£, of the 
boundary value problem associated with the wave equation 
(22) and their corresponding characteristic functions y,. The 
methods used are those of separation of variables. Theore- 
tically there is an interesting contact at this point also with the 
calculus of variations. For the boundary value problem is what 
the mathematicians call self-adjoint, and every such problem is 
closely associated with a problem of the calculus of variations. 
For the simple 3-dimensional case of the Schrédinger theory dis- 


= 


1932.] QUANTUM THEORY 217 


cussed above this latter problem is that of finding in the class 
of functions y satisfying the condition 


vazayas =1 
one which minimizes the integral 
five +2 + + 


This is an isoperimetric problem of familiar character whose 
extremals have the differential equation (22). The relationship 
between boundary problems and such problems of the calculus 
of variations has had an elaborate development. It was studied 
for the special case of minimal surfaces by Schwarz, and for 
more general cases by Hilbert and many others. 

The calculations of the quantum theory of Schrédinger ap- 
plied to the hydrogen atom and other cases gave theoretical re- 
sults which for the most part agreed as well or better with ob- 
served data than those of the earlier theory of Bohr. Sommer- 
feld’s treatment of the fine structure gave, however, a less 
satisfactory formula, according to his own statement.t The 
method used in all cases was to substitute the potential func- 
tion V of the system in the wave equation, then to determine 
the energy levels and characteristic functions for the boundary 
value problem, and finally to analyze the radiation emitted by 
the oscillators (24). What could not be accomplished directly 
could sometimes be attained by the use of perturbation methods 
similar to those used in other mechanical theories. 

In closing this section I wish to emphasize again the fact 
that in the Schrédinger theory, as in the preceding theory of 
Bohr, the final result was a matrix of elements of the form (24) 
which characterized the emitted radiation. Furthermore, for 
solutions of the form (21), the equation (20) or (22) is equivalent 
to the new equation 


(25) thw,/ (2m) = — Wyy + Wzz)/(84?m) + Vw, 
as one readily sees by eliminating E by means of the equation 
w, = — 2xiEu/h. 


t Atombau und Spektrallinien, Wellenmechanischer Erginzungsband, pp. 
132 ff. 


218 G. A. BLISS [April, 


If each momentum ? in the Hamiltonian function 
(26) H(q, p) = (p2 + pF + p?)/(2m) + V 


is replaced by the operator p=— (ih)/(27)-0/0q, where q is 
the corresponding coordinate, and if the resulting operator is 
designated by H(q, —(ih)/(2m)-0/dq), then equation (26) is 
expressible in the form 


ih Ow ih Oa 


These remarks have significance for the description in the next 
section of the more recent forms of the quantum theory. 


7. Present Forms of the Quantum Theory. The recent forms 
which the quantum theory has taken are not easy to describe in 
brief fashion because they involve matrix algebra and mathe- 
matical theories of probability as well as much of the mathe- 
matics which has been discussed in preceding pages. I should 
like to try nevertheless to give my fellow mathematicians some 
idea of the contacts which still remain with the calculus of 
variations, in spite of the fact that my experience with the newer 
theories is even more restricted than with the older ones. 

The first step toward the newer point of view seems to have 
been taken by Heisenberg. Because of the artificial character 
of the mechanical mechanisms involved in the Bohr theory, 
or perhaps one should say the non-mechanical mechanisms, he 
proposed to cast those aspects of the theory aside completely 
and to begin with the matrices (24) or (19) themselves which 
appear as the prime results of the theory of Schrédinger as 
well as that of Bohr. To the physicist this seemed a very bold 
proposal, perhaps because the notions of matrices and the 
matrix calculus are not a part of his usual mathematical kit. 
But to the mathematician Heisenberg’s proposal would, I think, 
seem a natural one in view of the results of the preceding 
theories. The lines of the spectrum have a double or matrix ar- 
ray of frequencies which can be regarded as caused by a similar 
array of oscillators. What then are the properties of the matrices 
of oscillators which characterize such spectra? It is this ques- 
tion which would have stumped the mathematician, and it is to 
Heisenberg’s unusual combination of physical experience with 


1932] QUANTUM THEORY 219 


mathematical instincts that an answer has been found which 
seems in many respects to be successful. 

Heisenberg’s mathematical postulates are in part simple 
enough when once they have been formulated. He proposed to 
replace each coordinate gq and momentum /? of a classical me- 
chanical system by a Hermitian matrix of the kind described 
in equation (24), and to replace the energy function H(q, p), 
expressed as a power series in the p’s and q’s, by the matrix de- 
fined by the same series when the matrices for p and g are sub- 
stituted. If a suitable interpretation could be found in matrix 
theory for a Poisson bracket (6) then the differential equations 
of the system could be expressed in the form 

dq: dpi. 
analogous to the equations (5). 

Heisenberg determined the interpretation of the Poisson 
bracket for his matrix theory by a limiting process which had 
been used in the so-called correspondence principle of Bohr and 
which I somewhat arbitrarily avoided in the preceding pages. 
The method of Dirac is more accessible to the mathematician. 
Dirac listed the rules of computation for Poisson brackets as 
defined for the classical theory and found an equation pointing 
to the fact that he could make them valid for matrices also by 
assigning to the bracket (f, g) for two matrices f and g the value 
c(fg—gh), where c is a constant. The difference fg—gh is not in 
general zero for matrices. When it is so the matrices are said to 
be permutable. It turns out that the value of c must be —27i/h 
in order to make the results of the theory agree with experiment. 

In classical mechanics the Poisson brackets for the coordinates 
and momenta have the simple values 


(ge, 91) = 9, (pe, Pr) = 9, (Qe, Pr) = 
where 6,,=1 and 6,;=0 for kl. Hence by analogy, and on ac- 
count of the definition of the Poisson bracket, the matrices gq, p 
of the quantum theory should satisfy the relations 
gigi — = 0, Pepi — = 9, 
(29) — = 
(27) = ihp,/(27) = pr — Hp,, 


220 G. A. BLISS [April, 


where the dots indicate time derivatives and the last two equa- 
tions are the equations (28) with the values of the brackets sub- 
stituted. For a particular dynamical system the energy matrix 
H(q, p) is the classical energy function of the system with 
matrices g, p in place of the classical coordinates and momenta. 
These are the fundamental equations of the quantum theory in 
matrix form. 

A canonical transformation is defined by a matrix S whose in- 
verse S- is the matrix formed from S by replacing all its ele- 
ments by their conjugate imaginaries and then turning the 
matrix through 180° on its principal diagonal as axis. In math- 
ematical language S—! is the conjugate transpose of S. If such 
a matrix is at hand, then the transformation P=SpS-!, Q= 
SqS-! takes every set of matrices p, q satisfying equations 
(29) into a new set P, Q satisfying equations of the same form. 
Such transformations were used by Heisenberg to find solutions 
of equations (29). When, for the case of the hydrogen atom for 
example, a canonical transformation had been found which took 
the energy matrix H into a diagonal matrix E, the elements 
E(n, n) =E, in the principal diagonal were the energy levels of 
Bohr, and the frequencies of the elements of the matrices q, p 
turned out to be the radiation frequencies 

v(m, n) = (En, — En)/h. 

This matrix calculus of Heisenberg has been given a different 
form and has been supplemented by a probability interpretation 
which is based upon analogs of well known algebraic proper- 
ties of Hermitian matrices. A vector Y= (Wi, - - - , Wn) of complex 
numbers in » dimensions is said to be normed if .* =1, where 
y..* is the conjugate imaginary of y,. It is orthogonal to a second 
such vector if =0. An n-dimensional Hermitian matrix 
m transforms every vector y into another my of the same type, 
and has a set of m normed and orthogonal vectors ¥(m’), each 
possessing a real number m’ such that m¥(m’) =m’V(m’). The 
vectors W(m’) are called the characteristic vectors of the matrix, 
and the numbers m’ are its characteristic numbers.{ An arbi- 
trarily selected vector y is expressible linearly in terms of the 
characteristic vectors in the form 


= >V(m')c(m’), 


+ They are analogous to the “eigenfunctions” and “eigenvalues” of Dirac. 


1932.] QUANTUM THEORY 221 


where the sum is taken for all the characteristic vectors. If y 
is normed, then it is readily provable that 


= Lic(m')c*(m’) = c(m’)|? = 1, 


an equation which suggests the possibility of interpreting the 
real numbers |c(m’) |? as probabilities. These results are stated 
here for n-dimensional vectors and matrices because it is that 
case which is familiar to most of us. In the quantum theory the 
range of the subscript & and of the characteristic values m’ may 
be a continuum, or a combination of a discrete and a continuous 
range, or of still more general character. For these other ranges 
the matrix theorems are usually not proved but are assumed to 
hold by analogy. 

In the presentation of the theory as given by Dirac the 
matrices of Heisenberg are replaced by much more general 
linear operators which operate on symbols y representing the 
possible states of motion of the dynamical system under con- 
sideration. Each state is represented not only by its symbol y 
but also by cy, where c is an arbitrary complex number, and by 
a symbol y¥* to which y is related as a vector to its conjugate 
imaginary. All linear combinations of symbols for states, of the 
form cyY1+cy2, also represent states. For any two states y, and 
¥2 a scalar product y*¥2 is postulated whose value is a complex 
number such that Y2*, = (y*¥-)*. If this number is zero the 
two states are said to be orthogonal. A symbol for a state may 
be normed so that y*~=1 by multiplying it by a suitable com- 
plex constant. One easily sees that the absolute value of this 
constant is uniquely determined but its amplitude or phase is 
arbitrary. 

At each time ¢ there is associated with each coordinate of the 
dynamical system a linear operator g which transforms each 
state y into another and which has properties like those ofa 
Hermitian matrix. The operator g has a system of normed and 
orthogonalized characteristic states ¥(q’) with characteristic 
numbers gq’ such that every state y is expressible in the form 


¥= 


by analogy with the matrix theory. If the symbol y is normed 
then one finds readily that 


G. A. BLISS 
(30) = 1. 


The fundamental idea of the probability form of the quantum 
theory is now that when an observation is made at the time ¢ to 
determine the value of a coordinate of the mechanical system in 
the state y the result is not necessarily unique. The values which 
may be found for the coordinate are the characteristic numbers 
q’ of the operator g, and the probability that a particular value 
q’ will be found at the time ¢ for the state y, is the term |c(q’) |? 
of the sum (30). The classical case would be the one for which 
q has a single characteristic number g’ and behaves like a 
Hermitian matrix with all elements in the principal diagonal 
equal to g’ and all other elements equal to zero. Such an operator 
has but one characteristic number gq’, and all states y are 
characteristic states, so that a measurement of g always gives 
the value q’ as a certainty. 

The momenta of the mechanical system are similarly repre- 
sented by operators p, and the energy by an operator which is a 
function H(qg, p) of the operators g and p. The equations which 
characterize these operators for a particular mechanical system 
are the equations (29) of Heisenberg for reasons similar to those 
described above in deducing them. It is provable that if two 
operators p and gq satisfy the third of equations (29), then every 
real number q’ isa characteristic number for g, and similarly for p. 
Moreover, the set of permutable operators qi, - - - , gr has normed 
and orthogonal characteristic states V(q’) =W(qi,---, in 
common, one and only one of which corresponds to every set 
of real numbers (q/,---,49;), and in terms of which every 
other state is expressible in the form 


f Wg W(q', Dad’. 


The integral is taken over the whole of the r-dimensional q’- 
space and y/(q’, ¢) is a function of q,’, - - - , g,’, t, which is said to 
represent y. The state y does not change with the time, but its 
representative must do so, since the operators g and their char- 
acteristic states V are changing. Wheny is normed it follows that 


ag f | #) = f CADTICADY 


1932.] QUANTUM THEORY 223 


If an observation to determine all of the coordinates g is made 
at the time ¢ upon a system in the state y the possible values for 
the coordinates are the characteristic sets (q,;’, - - - , g-’), and the 
probability that a set will be found in the volume element dq’ is 
lW(q’, t) |2dq’, where g’=(q:’, - - - , g,’) is a point in dq’. 

It is important and interesting to note that the representation 
¥(q’, t) of the states which are possible for a given dynamical 
system are precisely the solutions of Schrédinger’s wave equa- 
tion in the form (27). For it is provable from the equations (29) 
that the results of operating with qx, pi, or H upon a state with 
representative ¥(q’, ¢) are, respectively, the states represented 
by 


provided that the phases of the normed characteristic states 
W(q') are suitably chosen. But since H is also expressible in 
terms of the g’s and #’s it follows then readily that 


Ot qt) = q; ag 
where the operator on the right is formed by replacing in H(q, ) 
each of g and p by its value as an operator. Thus by reasoning 
quite different from that of Schrédinger we are led again to the 
wave equation (27). 

A mathematician should perhaps not venture to discuss the 
physical reasons for things, but I think I owe to my mathe- 
matical hearers an effort to explain why the quantum theory 
has adopted a probability aspect. There are a number of reasons, 
but I shall speak of only one. The Schrédinger theory provided 
functions w(qg, t) which belonged to the stationary states of an 
atom but it did not attempt to describe the mechanical character 
of the atomic system itself except in its identification of the 
group velocity of a one-parameter family of waves with what 
would ordinarily be the velocity of a mechanical particle. Such 
a family or packet of waves may have a resultant amplitude 
different from zero only in a limited portion of space whose ex- 
tent may increase or contract as it moves. The coordinates and 
momenta of such a packet taking the place of the mechanical 
particle would be to some extent uncertain. In the general 


224 G. A. BLISS [April, 


theory this uncertaintly corresponds to the fact that the mo- 
menta g are never permutable with the coordinate operators 
p. Operators have characteristic states in common in general 
only when they are permutable, and the only states for which 
the probabilities described above for a particular operator q or 
p are unity, and the results of observations therefore certain, are 
the characteristic states of the operator. It follows, therefore, 
that in general there is no state for which the coordinates and 
momenta are determinable simultaneously with certainty. The 
mean square deviations Ag, Ap of a corresponding coordinate 
and momentum from their mean values can be calculated at 
least in simpler cases, and it has been shown that AgAp=h/(27), 
which is the inequality expressing Heisenberg’s famous un- 
certainty principle. 

This is as far as we shall have time to go in the discussion of 
the various modifications of quantum mechanics. In the founda- 
tions of the operational or matrix form of the theory the influ- 
ence of the calculus of variations remains only in the preserva- 
tion of the notions of canonical equations and transformations, 
and in the boundary value problems connected with the wave 
equation of Schrédinger. But other equally interesting math- 
ematical domains have been freely called upon to contribute 
their quotas. An abstract operational calculus, a kind of general 
analysis of operations, with an associated theory of probability, 
seem to be requisites, and the details of the applications of 
these theoretical mechanisms to the special cases so far studied 
have also great mathematical interest. This can be said in spite 
of the fact that at the present time the pure mathematician, in 
his effort to understand the theory, must plough steadily for- 
ward oblivious to a multitude of detailed questions of math- 
ematical accuracy. In view of the success which the theory seems 
to have had it is not too much to hope that in the end all such 
questions may be satisfactorily answered or avoided, and it is 
even possible that the pure mathematician may contribute a 
valuable mite to this happy result. 


THE UNIVERSITY OF CHICAGO 


1932.] STATISTICAL MECHANICS 225 


STATISTICAL MECHANICS AND THE SECOND 
LAW OF THERMODYNAMICS* 


BY P. W. BRIDGMAN 


One thing that has much impressed me in recent conversa- 
tions with physicists, particularly those of the younger genera- 
tion, is the frequency of the conviction that it may be possible 
some day to construct a machine which shall violate the second 
law of thermodynamics on a scale large enough to be com- 
mercially profitable. This constitutes a striking reversal of the 
attitude of the founders of thermodynamics, Kelvin and 
Clausius, who postulated the impossibility of perpetual motion 
of the second kind as a generalization from the uniformly un- 
successful attempts of the entire human race to realize it. 
Paradoxically, one very important factor in bringing about this 
change in attitude is the feeling of better understanding of the 
second law which the present generation enjoys, and which is 
largely due to the universal acceptance of the explanation of the 
second law in statistical terms, for which Gibbs was in so large 
a degree responsible. Statistical mechanics reduces the second 
law from a law of ostensibly absolute validity to a statement 
about high probabilities, leaving open the possibility that once 
in a great while there may be important violations. Doubtless 
another most important factor in present scepticism as to the 
ultimate commercial validity of the second law is the discovery 
of the importance in many physical phenomena of those fluctu- 
ation effects which are demanded by statistical mechanics. It is 
very hard indeed for one who has witnessed the Brownian mo- 
tion for the first time to resist the conviction that an ingeni- 
ous enough engineer might get something useful out of it, and 
I have no doubt that many in this audience have tried their 
own hand at designing such a device, and I also have no doubt 
that their success has been discouragingly nil. 

There are other aspects also of the statistical point of view 
which have become prominent in the last few years, as for ex- 


* The Ninth Josiah Willard Gibbs Lecture, delivered at New Orleans, De- 
cember 29, 1931, under the auspices of this Society, at a joint meeting of this 
Society, the American Physical Society, and Section A of the A.A.A.S. 


226 P. W. BRIDGMAN [April, 


ample, the speculations of Eddington on time’s arrow and on 
the meaning of time in general, the speculations of Lewis as to 
completely reversible time, all the recent concern and new no- 
tions about the destiny of the universe as a whole, in which 
thermodynamic arguments play a most important part, and 
it is of course notorious that the notions of probability, which 
are fundamental in statistics, are at the very bottom of wave 
mechanics. 

It is evident, therefore, that the statistical point of view en- 
tails consequences important both conceptually and practically. 
In the hope of helping a little to a better understanding of the 
situation I propose today to examine a few of the implications 
and consequences of the statistical point of view. The program 
is a very modest one, and I hope you will have no expectation of 
a final solution of any of these difficult questions; my primary 
purpose is to awaken a more vivid self consciousness of what the 
situation actually is. I shall be mostly concerned with the 
classical statistics, and shall have less to say about questions 
raised by wave mechanics; we shall find that the questions 
raised by the classical point of view are sufficiently fundamental. 
I shall throughout adopt the point of view that I have called 
operational, that is, I shall seek the meaning of our state- 
ments and concepts by trying to analyze what it is that we do 
when we are confronted with any concrete physical situation to 
which we attempt to apply the concept or about which we make 
the statement. 

It is in the first place most important to realize that the 
statistical method, in which the notions of probability are 
fundamental, has, when applied to the understanding of physi- 
cal situations, certain inherent, unique, logical characteristics, 
so that any account which statistics can give of physical 
phenomena must have an entirely different aspect from that of 
such a method of approach as classical mechanics, for example. 
The reason for this is that probability is not a notion which can 
be applied to concrete individual events, whereas we demand 
that we understand, or predict, or control, the individual event. 
I demand to know what will happen when I throw this particu- 
lar stone in the air, or explode this particular charge in the 
cylinder of a gas engine. Ordinary mechanics gives an unequivo- 
cal answer, and in general the explanations of ordinary me- 


1932.] STATISTICAL MECHANICS 227 


chanics make direct contact with just such concrete individual 
physical events. But the notions of probability have no such 
application to individual events, and in fact the notion of 
probability is meaningless when applied to an individual event. 
The proof of this is given by mere observation of what we do in 
applying the notion of probability. Suppose that I show you a 
die and remark that I intend to throw it in a minute. You 
volunteer the information that the probability is one-sixth that 
the throw will be a six. I am sceptical and ask you to justify 
your statement by pointing out the property of the event, when 
it takes place, that can be described as a probability of one-sixth 
for a six. I then make the throw and get a six. What possible 
characteristics has this single event that can justify your state- 
ment? Your statement has immediate meaning only when 
applied to a long sequence of events, or when applied to the 
construction of the die and the method of throwing it. Even 
when applied to a sequence of events there is always an un- 
bridgeable logical chasm thwarting a precise application of the 
notion of probability to any actual sequence. Consider, for ex- 
ample, the classical example of tossing a coin. In practice our 
first concern is to determine whether the coin is a fair coin, that 
is, whether it is equally likely that heads or tails appear. Sup- 
pose that we make a million throws, and find the excess of one 
or the other not to be more than 1,000. Then we are likely to 
say that the coin is fair. But, logically, we are bound to recog- 
nize that the coin may have been loaded so that heads were, 
perhaps, nine times as likely to appear as tails, only that we 
had happened on one of those excessively rare sequences in 
which as many tails appeared as heads. Rigorously, there can 
be no method by which we can be sure that all our past experi- 
ence has not been one of these excessively rare sequences which 
logically we are bound to recognize as possible in any statistical 
assembly. 

Passing over these logical difficulties in applying precisely 
the notions of probability to any actual physical situation, it is 
evident, I think, that when applied to individual events, prob- 
ability can have only a secondary or derived meaning. I believe 
that an examination, as the operational point of view prescribes, 
of what one does, will show that the meaning to be ascribed to 
the probability of individual happenings is to be found in the 


228 P. W. BRIDGMAN [April, 


rules of the mental game that one plays in thinking about and 
trying to understand the individual events. This has important 
consequences when we attempt to use the notions of probability 
in building physical theories. We must recognize in the first 
place that any physical theory demands the construction of 
some sort of model. Now any model involving notions of proba- 
bility is of necessity more remote from the physical situation 
and more esoteric in character than the more usual and naive 
models, such as are offered by ordinary mechanics. For any 
statistical model involves conventionalized events to which the 
notions of probability are by definition applicable, although the 
notion of probability does not apply at all to the concrete 
physical events which are the counterpart of the events of the 
model. It is therefore not surprising that the connection be- 
tween the properties of the statistical model and the correspond- 
ing physical system is somewhat different from the connection 
in the more ordinary sorts of model. 

It would be possible to digress considerably here to discuss 
the properties which we demand in our models, and the uses to 
which we put them. The subject is fairly obvious, however, and 
I believe we can safely assume an understanding of the es- 
sential features. The least exacting demand that we make of a 
model is that it serve as a calculating device, by which we can 
predict actual physical happenings, and for this purpose any 
sort of consistent correspondence between the model and the 
physical happening is satisfactory. The simplest way in which 
the statistical model can satisfy this simplest and minimum de- 
mand is evidently that actual happenings in the physical sys- 
tem shall correspond to high probabilities in the statistical 
model. This furthermore seems to be the only possibility and 
all that we can do; in the model there are no certainties, only 
probabilities, some of which, it is true, may be very close to 
unity, whereas in the physical system there are invariable hap- 
penings, as for example, a cake of ice always melts when it is 
heated above the melting point, or the external atmosphere 
always rushes into an exhausted electric light bulb when it is 
cracked. On the other hand, it is most natural to say that low 
probabilities in the model correspond to infrequent occurrences 
in the physical system. But to go further, and specify exactly 
how close to unity we shall demand that the probability be that 


1932.] STATISTICAL MECHANICS 229 


is to correspond to an invariable happening, is not so easy, and 
there seems to be a certain unavoidable fuzziness here defeating 
every endeavor to make sharp connection between the model 
and actuality. There is a still greater difficulty in giving a pre- 
cise physical significance to events in the model of low proba- 
bility; we shall return to this question. For the present the 
important point for us is that any statistical model is in peculiar 
degree purely a paper and pencil model and peculiar limita- 
tions may be expected in the use of such a model. 

If the only demand that we put on the model were that it 
should serve as a computing device, the situation would be com- 
paratively simple and could be quickly dismissed. But as a 
matter of fact we make the more exacting demand that the 
model enable us to understand the physical situation, and to this 
end we demand that there be a further correspondence between 
the properties of the model and of the physical system. Since 
we do not usually make the extreme demand that the model 
enable us to understand all the physical properties of the system 
at once, we usually do not demand that there be an exhaustive 
correspondence between the properties of the physical system 
and those of the model, but we are satisfied with a correspond- 
ence of those properties only which are pertinent for our im- 
mediate purposes. Thus for the discussion of the thermody- 
namic properties of a perfect gas, a model is usually sufficient in 
which the molecules of the actual gas are replaced by perfectly 
elastic spheres or ellipsoids, although such a model gives no 
hint of the optical properties of the actual gas. It is curious, 
however, that even for thermodynamic purposes we would not 
be satisfied with a model in which the number of fictitious mole- 
cules is not equal to the number of actual molecules which 
various other sorts of physical evidence lead us to ascribe to the 
actual gas. 

If, now, we are attempting to find a model for the thermo- 
dynamic properties of a gas, we see that the accepted models 
which satisfy these additional requirements go far beyond the 
original demands, for in such models we encounter all the 
phenomena of fluctuations. Strictly, such a model never comes 
to equilibrium, and cannot therefore possess any property 
which strictly corresponds to temperature as defined classically 
in terms of equilibrium states. The remarkable fact, of course, 


230 P. W. BRIDGMAN [April, 


is that the fluctuations of the model were found to correspond to 
fluctuations in the physical system, as shown by the Brownian 
motion, and in consequence we have now come to recognize that 
temperature is physically only an approximate concept, instead 
of the exact concept originally assumed in thermodynamics. 
This gives us at once one possible method of dealing with the 
second law and its apparent violations. If we choose to formulate 
the second law by the statement that dQ/T is an exact differ- 
ential, then by its very form it applies only to those situations to 
which the temperature concept applies, and since the tempera- 
ture concept never exactly applies to any physical situation, we 
are left with a law which may be rigorously exact in the limit, 
but which applies to no actual situation. This method of meet- 
ing the situation may perhaps be satisfactory to the pure logi- 
cian, but to the individual interested in filling his pockets by 
bootlegging entropy, such considerations will appear as unin- 
teresting and as ineffectively legalistic as the restrictions which 
the more ordinary sort of bootlegger fails to recognize. 

It is probably not possible to set up a mechanistic model of a 
purely thermodynamic system, for the thermodynamic system 
knows no details, but only pressure and volume and tempera- 
ture, whereas it is the essence of a mechanical model that it 
contain details which can have no counterpart in the physical 
system in so far as it is purely thermodynamic. This prepares us 
to recognize that the concepts of thermodynamics have no 
absolute validity, but are relative to the operations, and in 
particular to the scale of the operations which we use. Thus a 
fluid in turbulent motion may have a temperature from the 
point of view of a thermometer with a bulb several centimeters 
in diameter, but may have no temperature from the point of 
view of a minute thermo-couple such as biologists have recently 
used in probing the interior of single cells. Or again, the entire 
body of phenomena to which the so-called third law is applica- 
ble would immediately appear in a different aspect and the 
third law would no longer be applicable if the operation of taking 
atoms apart and recombining them was added to the other per- 
missible operations of a more conventional character. In recent 
papers Dr. David Watson has discussed some of the conse- 
quences of a recognition of some of the other relativistic char- 
acteristics of the entropy concept. 


1932.] STATISTICAL MECHANICS 231 


There seem to be two diametrically opposite and equally 
natural reactions to an appreciation of this situation. The first 
reaction is that of the younger generation, part of which, at 
least, expects to discover some day in the realm of microscopic 
operations the possibility of a profitable entropy bootlegging 
business, and there is, secondly, the reaction of the other school, 
which is convinced that the second law involves something 
absolutely fundamental, and that any formulation in terms in- 
volving relative magnitudes or permissible operations can be 
only an incomplete formulation. 

In the endeavor of the second school to find a more funda- 
mental formulation it is natural to attempt to capitalize the 
striking success that the statistical point of view has already 
had in dealing with the phenomena of fluctuations. One recent 
and well known attempt in this direction is that of Eddington 
to appraise the second law in terms of what is essentially a 
shuffling process. Eddington sees in the universal tendency for 
entropy to increase, or for a system to pass from a less to a more 
probable configuration, the analog of what happens when a 
pack of cards is shuffled. Now although there may be strong 
points of analogy between these two processes, it seems to me 
that there are also essential differences, and that the analogy on 
the whole is not a happy one. Shuffling acquires meaning only 
when we are able to apprehend the cards as individuals by mark- 
ing them so that we can identify them as individuals, but in 
such a way that there shall be absolutely no effect on the shuffl- 
ing process. The picture that Eddington had in mind was a 
pack of cards freshly received from the manufacturer, arranged 
in suits and by rank in the suits, and then shuffled and losing 
all trace of its original arrangement. But as G. N. Lewis has re- 
marked, the arrangement of the cards in suits is one of entirely 
arbitrary significance; from the point of view of some other 
game thanwhist theinitial arrangement was already a completely 
shuffled arrangement as well as all the subsequent ones. Or we 
may look at it in another way. Imagine an infinite sequence of 
deals, the cards being partially shuffled between each deal, and 
suppose that a complete record is kept of all the hands. Some- 
where in this sequence there will be deals in which the cards are 
distributed among the hands by suit and the arrangement in 
each hand is by rank. If one examines the record it will be found 


232 P. W. BRIDGMAN [April, 


that as one proceeds away from the exceptional deal, both for- 
ward and backward in the sequence, all trace of the regular ar- 
rangement becomes obliterated. That is, shuffling with respect 
to any configuration selected arbitrarily as of special signifi- 
cance is symmetrical with respect to past and future time, and 
the analogy with the thermodynamic situation disappears. 
There are other difficulties with the shuffling point of view. We 
have seen that shuffling has meaning only when the cards can 
be identified as individuals. But if the cards can be handled as 
individuals, they can be unshuffled, as any whist player wil! 
demonstrate to you within five seconds of picking up his hand. 
To see in the shuffling process something analogous to the in- 
exorableness of the increase of entropy in nature involves the 
thesis that, although it may be possible to identify the elements 
in a physical situation, there is some restriction in nature which 
prevents us from treating these elements separately and sorting 
them out. This, it seems to me, is a hard doctrine. There is no 
suggestion of such a state of affairs in any picture offered by the 
classical mechanics, and classical mechanics was all that Ed- 
dington had in mind. Until the reason is elucidated for this 
surprising inability of ours to handle what we can see, I believe 
that the shuffling analogy must be judged only to obscure the 
situation rather than to clarify it. 

At first glance, the possibility of understanding the strange 
restriction against touching what we can see is even more re- 
mote from the wave mechanics point of view than from the 
classical, for seeing is now to be considered as a kind of touching, 
namely touching with a photon. It may well be, however, that 
we have here the key to the ultimate solution, for it is not suffi- 
cient merely to touch, but the touch must also control; such a 
kind of touch doubtless requires the cooperation of a great 
many photons, and thus will be less possible of attainment than 
the touch by a single photon which is sufficient for recognition. 
But this point of view we do not follow further here. 

Not only does Eddington see in shuffling the complete analogy 
of the inexorable increase of entropy, but he sees something even 
more fundamental, namely an explanation of the properties of 
time itself, and in particular why it is that time is unsymmetrical 
and flows only forward. This he has expressed by saying that the 
increase of entropy of the universe is what it is that gives direc- 


1932.] STATISTICAL MECHANICS 233 


tion to time’s arrow. This conception deals with such funda- 
mental matters and has been hailed in so many quarters as 
being of such unique profundity, that we may be pardoned for 
stopping for an examination of what is involved. Eddington sees 
the crux of the matter in an essential difference between the 
equations of ordinary mechanics, including electrodynamics, 
and thermodynamics. The equations of ordinary mechanics and 
of electrodynamics, which express what Eddington calls pri- 
mary law, are of such a character that the differential of time 
may be reversed in sign with no change in the equations, as, 
for example, in the equation for a falling body, d?s/dt? = —g. It 
is evident enough that the equation is unaltered if the sign of 
dt is changed; the question is what is the significance of this 
observation? The significance that Eddington ascribes to it is 
that the equation is unaffected by a reversal of the direction of 
flow of time, which would mean that the corresponding physical 
occurrence is the same whether time flows forward or backward, 
and his thesis is that in general there is nothing in ordinary 
mechanical occurrences to indicate whether time is flowing for- 
ward or backward. In thermodynamic systems, on the other 
hand, in which entropy increases with time, time enters the 
differential equation as the first derivative, so that the direc- 
tion of flow of time cannot be changed without changing the 
equation. This is taken to indicate that in a thermodynamic 
system time must flow forward, while it might flow backward in 
a mechanical system. 

As thus expressed there seems to be considerable vagueness 
about some of the ideas; this vagueness I find in Eddington’s 
original formulation. Such vagueness is natural and perhaps in- 
evitable in a popular exposition, but if we are to understand 
from the operational point of view what is involved here, we 
must try to be more precise. Careful analysis has not yielded 
to me more than the following as an exact statement of what is 
involved. Imagine a closed system, and an assistant with a 
note book in which at a given instant of time he notes all the 
data necessary to characterize completely the configuration of 
the system. At a later instant of time he records the correspond- 
ing data in another note book. He then gives us the note books 
and we try from a study of them to determine which set of data 
was recorded at the earlier instant of time. If the system was a 


234 P, W. BRIDGMAN [April, 


thermodynamic system we can make the decision, because the 
entropy increases with time, but if the system was a mechanical 
system we cannot decide which note book was used first, be- 
cause examples could be set up for either one or the other. Ed- 
dington expresses this difference by saying that the direction of 
flow of time has no significance in the mechanical system. 

It is, of course, true that the differential equation of the me- 
chanical system is differently constructed from that of the 
thermodynamic system, but the significance of the difference 
does not need to be formulated as does Eddington. We must in 
the first place remember that the equation of the mechanical 
system, for example, d?s/dt? = —g, applies not only to a single 
system, that is to a single falling body, but applies as well to a 
family of systems. The equation has the property that, cor- 
responding to every specific system, with its particle at a defi- 
nite point moving with a definite velocity at a definite instant 
of time, another system is possible with its particle in the same 
position at the same instant, but with a negative velocity. This 
is because the equation is of the second order, and gives on 
integration two constants, which may be so adjusted as to give 
any position and any velocity at any instant of time. Imagine 
the second system set up; as time goes on it will trace out in 
reverse sequence the positions of the first system, as may be 
seen from the equation itself, which may be written 


d?s dv d(— 2) 
diz dt) 


—g. 


It is this fact which makes it impossible to decide, in our ex- 
ample above, which note book was used first, because there is 
no way of telling from the entries alone whether they applied 
to the first or the second system. But in no case is there any 
question of time flowing backward, and in fact the concept of a 
backward flow of time seems absolutely meaningless. For how 
would one go to work in any concrete case to decide whether 
time were flowing forward or backward? If it were found that 
the entropy of the universe were decreasing, would one say 
that time was flowing backward, or would one say that it was a 
law of nature that entropy decreases with time? It seems to me 
that in any operational view of the meaning of natural concepts 


1932.] STATISTICAL MECHANICS 235 


the notion of time must be used as a primitive concept, which 
cannot be analyzed, and which can only be accepted, so that it 
is meaningless to speak of a reversal of the direction of time. I 
see no way of formulating the underlying operations without 
assuming as understood the notion of earlier or later in time. 
Lewis in a recent paper The symmetry of time in physics 
adopts a point of view in some respects similar to that of Ed- 
dington, although Lewis would certainly disclaim more than 
very partial agreement with Eddington. Lewis speculates about 
the fundamental significance of the symmetry of time in phys- 
ics. His point of view takes its origin from the four-dimensional 
representation of events employed in relativity theory, and the 
consequent reduction of all propositions in kinematics to propo- 
sitions in four-dimensional geometry, rather than from the form 
of the equations of mechanics, as in Eddington’s theory. But 
both neglect what I believe to be the most important aspect of 
the situation. Both the equations giving the motion of the 
system and the four-dimensional representation of the motion 
are only a small part of the story. The equations are without 
significance unless the physical operations are also defined by 
which numerical values are assigned to the various symbols of 
the equations. For instance, in treating a falling body, we need 
in addition to the equation itself a set of directions for the use of 
the equation, in which it is set forth, among other things, that 
sis the number obtained by making with a meter stick certain 
manipulations connected with an arbitrary origin and the in- 
stantaneous position of the falling body. Similarly in the four- 
dimensional representation, we must know the physical opera- 
tions by which the numerical values of the coordinates are ob- 
tained which go into the four-dimensional diagram. If one ex- 
amines the operations by which meaning is given to the symbols 
which occur in the equations or to the coordinates in the geo- 
metrical representation it will be found that the time concept 
has to be assumed as primitive and unanalyzable, for the opera- 
tions essentially assume that the operator understands the 
meaning of later and earlier in time. For example, in order to 
find the velocity of a particle, one has to observe its position at 
some one instant of time and combine with this in a prescribed 
way the result of another observation at a later instant. If one 
does not intuitively understand what is meant by a later in- 


| 


236 P. W. BRIDGMAN [April, 


stant, there is no method of formulating the operations. The 
same situation is involved in specifying a thermodynamic sys- 
tem. One of the variables is the temperature; it is not sufficient 
merely to read at a given instant of time an instrument called a 
thermometer, but there are various precautions to be observed 
in the use of a thermometer, the most important of which is 
that one must be sure that the thermometer has come to 
equilibrium with its surroundings and so records the true 
temperature. In order to establish this, one has to observe how 
the readings of the thermometer change as time increases. 

This point of view, that the schedule of operations by which 
the symbols acquire meaning is as important a part of the 
physical situation as the relations which are found to hold be- 
tween the symbols themselves, has an important bearing on a 
very widely spread tendency in modern physics and science in 
general to see nothing as significant except the relations, and so 
to reduce all science to a kind of topology. It is this point of 
view that is at the bottom of Einstein’s philosophy when he 
says, for example, that all that is observed is a series of space- 
time coincidences, and which Eddington expresses by saying 
that nature may be reduced to a series of pointer readings. If 
one grants that the ultimate object of physics is to establish a 
certain sort of relation between us and the world of our senses, 
to speak with a certain monstrous naivété, I do not see how it is 
possible to discard as irrevelant the fact, for example, that the 
fourth coordinate in the four-dimensional geometry of relativity 
has to be obtained by an entirely different sort of operation 
from the other three coordinates, or to regard the entire situation 
as exhaustively characterized by the relations between the 
numbers, irrespective of how they are obtained. 

We return now to a further consideration of our statistical 
model and the methods by which we shall establish a cor- 
respondence between its properties and those of the physical 
system. Hitherto we have been gratifyingly successful; events 
in the model of overwhelmingly high probability correspond to 
invariable happenings in the physical system, and less common 
events in the model, such as fluctuation phenomena, are found 
to be prophetic of a previously unsuspected new domain of 
physical effects, typified by the Brownian motion. Encouraged 
by this success, it is natural to think that we have got hold of 


| 

| 
| 

| 


1932.] STATISTICAL MECHANICS 237 


something real, whatever that may mean, and to push our 
scheme of correlation to the logical limit, and say that all the 
excessively rare events corresponding to low probabilities in the 
statistical model are correlated with corresponding rare events 
in the physical system. Now it is a consequence of the funda- 
mental assumptions which have gone into the usual statistical 
model, namely that all elementary configurations are entirely 
independent of each other, so that the probability of any con- 
figuration is to be calculated by purely combinatorial methods 
from the relative number of ways in which the configuration 
can be realized, that there is some chance of the occurrence of 
any configuration, no matter how unusual its properties. This 
would mean that in the corresponding physical system any con- 
figuration whatever, compatible with the fixed conditions, would 
occur occasionally, as, for example, the gas in a box will oc- 
casionally automatically all collect itself into one end. This con- 
clusion is indeed taken literally by many experts in statistical 
mechanics, and in the literature statements are not uncommon, 
such, for example, as that of Bertrand Russell in a recent maga- 
zine article that if we put a pail of water on the fire and watch 
it for an indefinite time, we shall eventually be rewarded by 
seeing it freeze. It seems to me that there are a couple of ob- 
jections that can be made to the conventional treatment of 
rare occurrences, which I shall now examine. 

The first difficulty is with the technical method of calculating 
the chances of observing a rare configuration, and is concerned 
only with the model itself, and not with the physical application 
of the results of the calculations. In computing the chance of 
any configuration, it is always assumed that the elements of the 
statistical model are without influence on each other, so that 
the chance of a given configuration is given by merely enumerat- 
ing the number of complexions corresponding to the given con- 
figuration. For example, in the kinetic theory of gases it is as- 
sumed that the location of any molecule and its velocity is, ex- 
cept for the restriction on the total energy and the total volume, 
independent of the location or the velocity of any or all of the 
other molecules. It may be proper enough to postulate this for 
the model, but we know that it cannot rigorously correspond 
to the physical system, for the molecules of a gas do interact 
with each other, as shown by the mere fact that they conserve 


238 P. W. BRIDGMAN [April, 


their total energy, and the transmission of energy from one 
molecule to another takes place only at a finite rate, so that if, 
for example, at one instant all the velocity were in a single mole- 
cule, we would find that immediately afterward only molecules 
in the immediate vicinity had any velocity. This means that the 
assumption of complete independence must be recognized to 
be only an approximation, and some way of handling this ap- 
proximation must be devised. The method usually adopted is to 
cast the problem in the form of inquiring how many observa- 
tions must be made in order that the chance of observing the 
desired rare configuration may be one half, for example, choos- 
ing the time between observations so long that at each observa- 
tion all appreciable trace of the previous configuration shall 
have been obliterated, so that the assumption of independence 
may apply. The point now is this: the time that one has to wait 
for the probable obliteration of all traces of a previous con- 
figuration becomes longer the rarer the previous configuration; 
obviously it takes longer for a gas to efface all trace of having 
been all concentrated in one half of its available volume than to 
efface the traces of a small local concentration. The situation is, 
therefore, that not only must we make an increasingly large 
number of observations in order to hope to witness a rare con- 
figuration, but the interval between our observations must also 
get longer. It is merely the first factor which is usually con- 
sidered; when both factors are considered it is not at all obvious 
that the process is even convergent. This point should be sub- 
jected to further examination. 

There is another difficulty connected with the mere calcula- 
tion of the probability of rare occurrences presented by quantum 
theory. All classical calculations assume that the molecules have 
identity. But the uncertainty principle sets a limit to the 
physical meaning of identity. It is not possible to observe the 
position and velocity of any molecule with unlimited precision, 
but there is a mutual restriction. After an observation has been 
made, the domain of uncertainty in which the molecule is 
located expands as time goes on. If the domains of uncertainty 
of two molecules overlap, then the identity of the molecules is 
lost, and a subsequent observation will not be competent to 
decide which molecule is which. The only way of maintaining 
the identity of the molecules is by making observations at 


1932.] STATISTICAL MECHANICS 239 


intervals so frequent that the domains of uncertainty have not 
had time to overlap. But this time is obviously much shorter 
than the time between observations demanded by the require- 
ment that all trace of the previous configuration shall have been 
wiped out. Furthermore, the act of observation, by which the 
concept of identity acquires meaning, alters in an uncontrollable 
and unpredictable manner the motion of the molecules, whereas 
the statistical treatment requires that the molecules be un- 
disturbed between successive observations. It seems, therefore, 
that the physical properties of actual molecules as suggested 
by quantum theory are different from those of the molecules of 
the model, and this would seem to demand at least designing 
new methods of calculating the chances of rare occurrences. 
Apart from these objections, which may be met by the dis- 
covery of new theoretical methods of attack, it seems to me 
that the most serious difficulty with this question of rare states 
is met in the process of transferring to any actual physical 
system conclusions based on a study of the corresponding model. 
Suppose, for example, that we are discussing the problem of the 
tossing of some particular coin. If the coin is a fair coin, that is, 
if the chances of heads and tails are even, then our statistical 
model consists merely of a sequence of one or the other of two 
events, each of which is as likely to occur at any time as the 
other, absolutely independently of what may have happened 
elsewhere in the sequence. The theoretical discussion of this 
model is very easy, and we are convinced that conclusions 
drawn from a discussion of the model will apply to the tossing 
of the coin, always provided that the coin is a fair coin. As a 
particular problem we may consider the chance of throwing 
heads ten consecutive times. The chance of this is (1/2), or 
1/1024, which means that in every 1,000 consecutive throws the 
chances will be roughly even that there will be somewhere a 
sequence of 10 heads.* But 1,000 throws are a good many, and 
it may be that we have never made so many throws, and are 
content merely to make the prediction that if some one else 
should make so many throws it would be found to be as we say. 
But suppose that some one questions the fairness of the coin, 


* I am much indebted to Mr. H. M. James for a rigorous solution of the 
interesting problem involved here. He finds that between 1422 and 1423 throws 
are necessary for a 0.5 chance of ten or more consecutive heads. 


i 


240 P. W. BRIDGMAN {April, 


and says that he has reason to think that there is a bias of 10% 
in favor of throwing tails, so that the chance of a head at a single 
throw is only 0.45 instead of 0.50. We find now on making the 
calculation that we shall have to make roughly 10,000 throws in 
order to have an even chance of getting a sequence of 10 heads; 
and, in general, that slight imperfections in the fairness of the 
coin make very large differences in the chance of rare occur- 
rences. In view of this, we feel that it behooves us to make some 
objective test of the fairness of the coin before we venture to 
publish our prediction that we are likely to get a sequence of 10 
heads in 1,000 throws. We make the most direct test possible by 
appealing to the fundamental definition of fairness, which is 
that in a large number of throws the ratio of the number of 
heads to tails tends to equality. But how many throws are 
necessary to establish such an equality with satisfactory assur- 
ance! There is another theorem here, namely that in ” throws 
the chances are even that we shall have an excess either of heads 
over tails or of tails over heads of 0.6745 n1/?. Neglecting the 
numerical factor for our rough purposes, this means that if 
we make a hundred throws the chances are nearly even that 
the number of heads is somewhere between 46 and 54. To estab- 
lish the fairness of the coin we would have to make a considera- 
ble number of 100 throws at a time and observe whether or not 
the number of heads clusters between 45 and 54. If, on the other 
hand, there is a 10% bias in favor of tails, the number of heads 
will cluster between 40 and 50. The precise number of sequences 
of 100 throws at a time necessary to convince us that there is no 
10% bias in favor of tails obeys no definite criterion, but it is 
certainly of the order of ten or more, which makes 1,000 or more 
throws altogether. But this was the number of throws necessary 
to obtain one of the rare sequences of 10 heads. 

The conclusion from all this is plain; in order to establish with 
sufficient probability that the actual physical system has those 
properties which are assumed in estimating the frequency of 
rare occurrences it is necessary to make a number of observa- 
tions so great that the probability is good that the rare occur- 
rence has already been observed. In other words, purely logical 
statistical considerations never can justify us in predicting 
events so rare that they have never yet been observed. A pail 
of water has never been observed to freeze on the fire; statistical 


1932.] STATISTICAL MECHANICS 241 


considerations give us no warrant whatever for expecting that 
it ever will. Such predictions can be made only on the basis of 
considerations other than statistical. Thus in the case of the 
coin, an exact measurement of its dimensions and of the degree 
of homogeneity of its metal might convince us that the chances 
of heads and tails were even, because of our knowledge of the 
laws of mechanics. But when we come to the molecules of a gas 
or the elements of other physical systems to which the statisti- 
cal method of treatment is usually applied, we see that there is 
no method of independently handling the elements, so that the 
statistical method of testing the validity of our assumptions is 
the only possible one. This is a most natural situation, because 
if we were capable of dealing with the elements of the physical 
system as individuals we could apply more powerful methods 
than the statistical. Incidentally we may remark how very in- 
sensitive the statistical method is in studying elemental proper- 
ties; this is shown by the example of che coin above, where we 
had to make something of the order of 1,000 throws to establish 
an asymmetry of 10%. In many cases, however, the statistical 
method is doubtless the ultimate and the only method. 

Another of the applications of statistical ideas in which 
there has always been much interest, and especially lately, is to 
the problem of the ultimate fate of the universe. It is a very 
common opinion that the second law, in its original classical 
form, demands the ultimate heat death of the universe, be- 
cause of the inexorable increase of entropy to a final maximum, 
when all temperature differences shall have been wiped out. The 
chief mechanism in the ultimate equalization of temperature is 
obviously the radiation that is continually emitted by the stars. 
The human mind has, however, shown itself curiously unwilling 
to accept the prospect of a heat death, and there have been 
a number of attempts to avoid such an unwelcome conclusion. 
At least two of these, somewhat similar to each other, are based 
on the statistical interpretation of the second law. The first of 
these utilizes the theorem that in a closed mechanical system 
any configuration, once experienced, is bound to recur after the 
lapse of sufficient time. According to this view, the universe 
endlessly goes through cycles of repetition, the so-called Poin- 
caré-Zermelo cycle, of prodigious but calculable duration. The 
obvious objection to this picture is that in order to realize a 


242 P. W. BRIDGMAN [April, 


Poincaré-Zermelo cycle the laws of classical mechanics would 
have to be satisfied with an exactness quite fantastic, hopelessly 
beyond the possibility of direct or indirect verification. The 
second attempt to make statistics avoid the heat death rests 
on the theorem that a statistical assembly is never in complete 
equilibrium, but is always subject to fluctuations, and these 
fluctuations may attain any intensity if we only wait long 
enough. The present state of the universe is then to be regarded 
only as a fluctuation, with the possibility that similar fluctua- 
tions may recur in the future. The difficulty with this point of 
view is the excessive rareness of the sort of fluctuation corre- 
sponding to the present state of the universe compared with the 
approximate dead level of the heat death. The previous con- 
siderations apply; this is one of those configurations so rare that 
one has no right to predict its occurrence unless it has been pre- 
viously observed. One might predict from purely statistical con- 
siderations the occurrence of such a fluctuation in the future if 
one were sure that one were observing such a fluctuation now. 
But where is the evidence for this? According to the astronomers 
the fluctuation has been taking the last 10", or perhaps now 
10", years or so to smooth itself out to its still considerable 
roughness, and there is certainly no evidence that before 10'* 
years ago the entropy was decreasing instead of increasing. 
There are other objections to an application of the second 
law to the entire universe. The original formulation of the sec- 
ond law was, of course, restricted to isolated systems. By what 
logical right can the argument be extended to the entire uni- 
verse? A natural reply is that relativity theory seems to demand 
that the universe is finite, so that the whole universe becomes 
the sort of isolated system demanded by the classical formula- 
tion. But I believe that examination will nevertheless show a 
very important difference betweeen the smaller and the all-em- 
bracing closed system. Statistical mechanics, if it is to avoid the 
difficulties already discussed when applied to any individual 
physical situation, must make the assumption of molecular 
chaos. But what in the physical situation gives rise to molecular 
chaos? If we imagine a gas, for example, in a perfectly reflecting 
enclosure, and suppose that the molecules are perfectly elastic 
spheres, then, according to the classical picture, every collision 
takes place under perfectly definite conditions, so that a mathe- 


1932.] STATISTICAL MECHANICS 243 


matician of sufficient power could compute backward from the 
present configuration to the configuration at any past time, as, 
for example, when a partition might have been removed from 
the middle of the compartment. This sort of condition certainly 
cannot be described as molecular chaos. But the walls are molec- 
ular in structure, so that the reflection of the individual gas 
molecules follows no definite rule. If we regard the molecules 
in the wall as part of the external universe, and if there is no 
coordination between the motion of the molecules of the wall 
and what is taking place in the gas because of the enormous 
magnitude of the external universe compared with the gas in- 
side, then a physical reason justifying the assumption of molec- 
ular chaos is at once apparent. When the entire universe is con- 
sidered there can be no such justification as this for assuming 
molecular chaos, but the whole course of events must, from the 
classical point of view, run a rigorously deterministic course, to 
which statistical considerations do not apply. It may be objected 
to this argument that a gas in contact with its walls in the way 
described above is not an isolated system. It is, of course, not 
completely isolated, but it is nevertheless as far as the thermody- 
namic requirements go, which are concerned only with transfers 
of energy and of heat from the outside. Complete isolation 
would seem to be incompatible with molecular chaos. 

It was intimated at the beginning of this discussion that the 
heat death is supposed to be a consequence of the second law, 
and that the continuous enormous radiation into empty space 
of the stars is the most striking manifestation of this tendency. 
This point of view sees in the emission of every photon by a 
star part of the inexorable increase of entropy. I believe, how- 
ever, that this is fallacious, that the relations are different, and 
that the heat death with which we are confronted as a conse- 
quence of continued radiation is an affair of the first law, not of 
the second, and will take place when all the energy of the uni- 
verse has been radiated away, not merely when the energy is 
uniformly distributed. Elementary considerations justify this 
contention. Consider a body’ radiating into empty space. It is 
continually dropping in temperature and losing in energy. The 
emission of radiation therefore decreases the entropy of the ra- 
diating body. Consider next a body in thermal equilibrium with 
its surroundings; since it is in equilibrium its entropy is constant, 


244 P. W. BRIDGMAN [April, 


and furthermore it absorbs as much radiation as it emits. Ab- 
sorption of radiation, therefore, increases the entropy of the 
absorbing body. This is sufficient to give a straightforward ac- 
count of entropy changes in radiation problems. Consider two 
bodies confronting each other in a cavity at constant tempera- 
ture. A photon leaves one of the bodies, decreasing the entropy 
of that body. During the passage of the photon across the space 
separating the two bodies the entropy is to be thought of as as- 
sociated with the photon and residing somewhere in the inter- 
mediate space. When the second body absorbs the radiation, 
the photon with its entropy disappears from empty space and 
increases the entropy of the absorbing body. At all stages of the 
process the entropy of the entire system is unchanged. But now 
suppose that one body radiates to another at lower temperature. 
This process is irreversible and is accompanied by an increase of 
entropy. The first two steps of the process, emission of the pho- 
ton and passage across the intermediate space, are the same as 
before, and are therefore accompanied by no net change of en- 
tropy. It is only the last stage of the process, absorption by the 
body at lower temperature, that can give the uncompensated 
increase cf entropy. The mechanism of this increase is to be 
found in a diffusion of the energy of the photon into the greater 
number of degrees of freedom corresponding to the lower tem- 
perature. Actually, this argument is over-simplified, and must 
be modified by a consideration of the distribution of the photons 
through the spectrum, but the details of this point of view need 
not concern us further. The immediate point for us is that emis- 
sion of radiation into empty space is not an entropy changing 
process; the increase of entropy can occur, if it occurs at all, only 
during the act of absorption. 

But what physical evidence have we of the absorption of the 
light radiated by the stars? To save the situation we must pos- 
tulate absorption under completely unknown conditions. But is 
the assumption of such unknown conditions in regions so terri- 
bly far beyond access to us in order to save the second law any 
easier as an intellectual feat than the assumption of other un- 
known conditions which would defeat the second law? Are we 
not completely in the dark here, and had we not better admit it? 

Finally, I briefly summarize what I believe to be the principal 
results of this analysis and indicate the possible lines of future 


1932.] STATISTICAL MECHANICS 245 


progress. The most important result will be, I hope, a keen 
realization that in using statistics we are only using a paper and 
pencil model, which has logical difficulties within itself and 
difficulties of application to concrete physical situations which 
are very much greater than the. corresponding difficulties with 
more ordinary sorts of model. Some of these difficulties I believe 
can never be surmounted, so that the statistical model can never 
be satisfactorily used by extrapolation either into remote epochs 
of time, to predict rare events, or into remote reaches of space, 
to give us an idea of the course of universal evolution. Our model 
has not given us a satisfactory answer to our initial question 
as to the possibility of commercially profitable violations of the 
second law. The answer to this question will probably be found 
when the wave mechanics point of view has been completely 
worked out. Some of the other logical difficulties of the classical 
statistics, I believe, will also be surmounted by the adoption of 
the wave mechanics point of view, which assumes probability 
to be a primitive property of the elements of the model, rather 
than an emergent property resulting from the cooperation of 
great numbers. Before, however, the wave mechanics thesis of 
the primitive character of the notion of probability can be ac- 
cepted, much more experimental work is necessary. If the thesis 
is supported, as seems probable from the evidence now at hand, 
this will constitute to a certain extent a defeat of the purpose of 
the classical statistics, which was to explain why many physical 
assemblages of large numbers of elements obey the rules of 
probability. But even granted that the primitive character of 
the notions of probability acquires an experimental verification, 
it seems to me that some of the logical difficulties will persist, 
justifying a doubt as to the possibility of ever setting up a 
logically completely satisfying correspondence between our 
models and our experience. 


HARVARD UNIVERSITY, 
RESEARCH LABORATORY OF PuysIcs 


LEONARD CARLITZ 


NOTE ON DIOPHANTINE AUTOMORPHISMS* 


BY LEONARD CARLITZT 


1. Introduction. Professor E. T. Bellf defines a diophantine 
automorphism of a form f=f(x1, ---, Xn), where f is a homo- 
geneous polynomial, to be a transformation of f into f*, h an 
integer >1, by means of a substitution X;=X;i(m,---,%n), 
(¢=1,---,), so that 

The first non-trivial instance of an algebraic phenomenon of 
this kind is due to Eisenstein,§ who noticed that the discrimi- 


nant of the general binary cubic possesses such an automor- 
phism. Indeed, if we set 


f(%1, X2, X3, X4) = xP — 3xP x? + + 4x8 


: 1 af 
= 3x — xP xy — 2x8 = — — 
2 
; 1 af 
Xe = — 41%2%4 X3 = — ; 
6 0x3 
, 1 of 
X3 = %1%3%4 — 2x2 + xox? = — — 
6 Xe2 
1 af 
Ox, 


it may be verified that f(X1, X2, X3, Xs) =f*(x1, x2, %3, x4). This 
follows at once if it be noticed that f(X) is the discriminant of 
the cubicovariant of the basic binary cubic, and further that 
all the invariants of the binary cubic are expressible in terms of 
the discriminant. 

Cayley has given a different proof and at the same time a 
generalization|| of Eisenstein’s result. If we set 


* Presented to the Society, December 30, 1930. 

{ National Research Fellow. 

t This Bulletin, vol. 33 (1927), pp. 71-80. 

§ Journal fiir Mathematik, vol. 27 (1844), p. 105. 

|| A. Cayley, Collected Mathematical Papers, vol. 1, 1889, No. 15, Note 
sur deux formules données par MM. Eisenstein et Hesse, pp. 113-116. 


246 (Apri 
| 


1932-] DIOPHANTINE AUTOMORPHISMS 247 


u(x) = + + xf + + + 
— 2x1 — — 
— — — 4X5 Xe, 
1 Ou 


2 Ox; 


then Cayley’s result is u(X) =23(x). In another paper,* Cayley 
proves that the Hessian of Ejisenstein’s quaternary quartic, 
that is, f(x1, x2, x3, x4) above, is the product of a numerical con- 
stant into the square of f. However, no connection between this 
property and the automorphic property is indicated. 

Another instance of (1) is furnished by the general symmetric 
determinant of order m: its adjoint is also symmetric and is equal 
to the (n—1) st power of the original determinant. This is of 
course true of the general determinant. However the latter is 
quite trivial since the general determinant, regarded as a func- 
tion of n? independent variables, is composable, and clearly all 
composable forms have the automorphic property (1). It should 
be noticed that here again the automorphic transformation is 
expressed in terms of partial derivatives of the first order. 

In this note we consider diophantine automorphisms charac- 
terized by this property. We assume a form f of degree k in n 
variables such that 


2 = Of(x OX; 
(2) 
j=1 Ox; Ox; 


where a and c;; are numerical. We prove two results. 

(I) The transformation defined by the second of (2) is a Cre- 
mona transformation, and (but for a linear transformation) is 
of period two: 

n 
(3) X(X) = 

j=1 
This result is a sort of converse to a result of L. Weisner,t who 
shows that from a Cremona transformation of finite period a 
diophantine automorph may be deduced. 


* Loc. cit., No. 54, Note sur les hyperdéterminants, pp. 352-355. 
t This Bulletin, vol. 33 (1927), pp. 707-712. 


248 LEONARD CARLITZ [April, 


(II) If f(x, - - - , xn) is irreducible (in the field of rationals, 
say), then the Hessian of f is aconstant multiplied into a power 
of f: 

(4) H{f] 


This second result shows the connection between the Cayley 
identities quoted above. From (4) we have as an immediate 
corollary that for a function satisfying (2) the degree, k, is a 
divisor of 2”, n being the number of variables. 


2. The Cremona Property. Let us write 


fi= » = 
Ox; j=1 
Then since 
Of; Of; OX; jar OX; 
we find that 
x of(x 
Y.(X) = = ) = a — 
(5) j dX; of; afi 
Ox; Ox; 
= = a(k — 1)f*-? 
a(k — 1)f** Dif; 
But 
Lis = Ff, 
so that 
Ox; Of dx; Ox; 
Of; Of; Ox; Of; “aed 
and 
Ox; 
(k-1) 


Substituting into (5), we find 


Y(X) = af*-?. Xi, 
and 


(3) X(X) = 


where |] 


1932.] DIOPHANTINE AUTOMORPHISMS 249 


3. Proof of (4). It is evident that 


ox 


Ox 


ox 


Ox 


From this fact, and from the equations 


av(X) 
= = H 
| ax | axax;| 
ox 0? f(x) 
= = H 


where H denotes the Hessian constructed with respect to the 
variables indicated, we find 


eH [f(X) [f(2)] = 


Ox; 


The evaluation of this determinant presents no difficulty. Let 
us use the notation ¢; =0¢/0x;, and let 6;; denote the Kronecker 
delta. Then we may write 


xP) 
| | | =x; + $8;;5| 
Ox; 

t+ Xn X2 
XnP1 XnPe2 + ? Xn 

0 0 0 —1 
o O x1 
0 Xe 
0 0 Xn 
—1 


250 LEONARD CARLITZ [April, 


Then we shall have 
{1,2,--- ,n} = 2, , n} — £4647 
= ¢7{3,---, n} — + x262)¢"- 
— $* — (xidit - = — (s+ 


if @ be homogeneous of degree s. Therefore 


vj 


= (k — 1)2fn(#-2), 


and substituting in (6), 
eH [f(X)H[f(x)] = ar(k — 


If now we make use of the hypothesis that f(x, - - « ,x,) is irre- 
ducible we have immediately 


(4) H[f(x)] = 


the exponent being found by comparing the degrees of both 
members. 


4. Forms Not Satisfying (2). It is by no means necessary that a 
form f having the automorphic property satisfy (2). This is ob- 
vious if f be composable and of degree #3. Furthermore simple 
examples exist of forms which are not composable. As an exam- 
ple we remark that 


(af + af + xf)? = (27 + xf — ag)? + (2x1%3)? + (2x2%3)?, 


but, as is well known, x? +x? +x? is not composable. 

In another note I shall indicate the construction of a class 
of forms satisfying (1) but in general not (2); the forms are de- 
rived from invariant theory. 


CALIFORNIA INSTITUTE OF TECHNOLOGY 


1932.] MATRIX DIFFERENTIAL OPERATOR 251 


ON A MATRIX DIFFERENTIAL OPERATOR* 
BY A. K. MITCHELL 


1. Introduction. H. W. Turnbullf has defined a matrix differ- 
ential operator and has given several theorems concerning its 
properties. The first of these theorems is proved by induction, 
and for the fourth, use is made of a lemma concerning the prin- 
cipal minors of a determinant for which he gives a long and 
intricate proof. It is the purpose of this note to give a simple and 
direct proof of this lemma and of the fourth theorem, and to 
indicate how each of the first three theorems proved by Turn- 
bull may be obtained directly. In the last paragraph will be 
given, in addition, the results of applying the operator suc- 
cessively to the coefficients of the characteristic equation of a 
matrix. 


2. The Matrix Differential Operator. Let 
Ei} ---£} 


E2 E?---E? 


Ef Ev Es 


be an n-rowed square matrix whose n? elements are treated as 
independent variables. This matrix is characterized by a typical 
element, and we shall write E = E,’, where r denotes the row and 
s the column. In this way if we have another matrix F= F,", we 
shall write,{ from the law of multiplication of matrices, 
EF = E,'F?. 
We observe that 
EF = E,{F? = 
whereas 
FE=F,'E¢ = EF. 


* Presented to the Society, March 26, 1932. 

7 H. W. Turnbull, On differentiating a matrix, Proceedings of the Edin- 
burgh Mathematical Society, (2), vol. 1, Part 2 (1928), pp. 111-128. 

¢ Throughout this paper repeated Greek suffixes will denote summation 
from 1 to 2. 


252 A. K. MITCHELL [April, 


The matrix differential operator © is defined as follows: 


O/OE} O/dE? ---d/dEs 
0/dE? ---0/d0E* 


| 0/OE} ---d/dEz 


We note that the order of suffixes is transposed from that of E. 
The effect of the operator Q is defined by the ordinary law of 
multiplication of matrices; thus QF =0F2/dE¢. For a scalar 
function f we have Of = 0f/0E,. 

3. Proof of Turnbull’s Theorems. Now let I, denote the sum of 
all the s-rowed principal minors of the determinant of the 
matrix E. The fourth theorem given by Turnbull, which we shall 
prove directly, is then as follows. 

THEOREM. 


— = 0, E — 671; + QI. = 0, 


— (— + (— 1)*07, = 0. 


The proof will depend upon the properties of the generalized 
Kronecker delta* 6%: ::% which, if the superscripts are distinct 
from each other and the subscripts are the same set of numbers 
as the superscripts, has the value +1 or —1 according as an 
even or an odd permutation is required to arrange the super- 
scripts in the same order as the subscripts; and which, in all 
other cases, has the value zero. In particular 6,7=0 if rs, 

«=1if r=s. Then, J, being the sum of all the s-rowed principal 
minors of E, we may write: 
I, = 
1 By 


1 a B, 


* See F. D. Murnaghan, American Mathematical Monthly, vol. 32 (1925), 
p. 233; and O. Veblen, Invariants of Quadratic Differential Forms, p. 3. 


— 


1932.] MATRIX DIFFERENTIAL OPERATOR 253 


from which we have immediately 
Q1, = = 


Continuing in this way we obtain by an easy calculation* 


n+l_r 


+--+ + (—1) Es 


Transposing the left members of these equations we have the 
theorem stated above. 
From the foregoing we see that 


a a t—1 a, 
OI p-1/OEq = 5,1 p-2 —1p-sE, + 
and 


p+lir 


from which it is evident that 
01,/0E, = p-1 — (OI p-1/OEx) Ea 


This is the lemma used by Turnbull. 
By induction Turnbull proves the following formula: 


QE = + s, Et? i+ 


where s, is the sum of the rth powers of the latent roots of the 
matrix E (that is, --- +A,’). 

Now by a theorem due to Sylvester, the latent roots of any 
rational function of a matrix E are the corresponding functions 
of the latent roots of E. In particular the latent roots of E’ are 
Ay’, Ao", An” From 


= 


by differentiation, we obtain 


* See The derivation of tensors from tensor functions, American Journal of 
Mathematics, vol. 53 (1931), p. 198. 
fSylvester, Mathematical Papers, vol. 4, p. 133. 


A. K. MITCHELL 


Bi ar 


But the sum of the latent roots of a matrix is equal to the 
sum of the elements of the leading diagonal of the matrix, so that 
we have by Sylvester’s theorem, s,=E%, 
s'= Ej, EZ - - - Er. Hence, from the above expression for QE’, 


QE’ = + + ---+ + 5-1. 


In like manner each of the Theorems I and III of Turnbull may 
be proved directly. 


4. Conclusion. Let us now consider the effect of repeating the 
application of the operator 2 on the functions Ih, I2, - - - , Jn de- 
fined above. The J’s so defined are the coefficients of the char- 
acteristic equation* of the matrix E. From the definition of J, 
and of 2 it is readily seen that 


0°71, = [1/(p — Ea, Ear 
= — [(p — 1)/(p — 1) 
= — [(n— p+ 1)!/(m — 5.4, 
and 
077 5-1 — (n = p + 2) QI p-2. 
Therefore 
27, = [(n — p + 2)!/(m — 


p= (— — — 
= (— 1)?#[(n — 1)!/(m — 
When this becomes 
271, = (— — 1) 


Thus if the operator 2 is applied 1 times to the m-rowed deter- 
minant | E,’| we obtain (—1)**#(n—1)! times the unit matrix. 


~ 
Il 


Trinity COLLEGE 


* See A note on the characteristic equation of a matrix, American Mathe- 
matical Monthly, vol. 38 (1931), p. 386. 


254 [April, 


1932.) FREDHOLM-STIELTJES EQUATION 255 


SOLUTIONS OF BOUNDED VARIATION OF THE 
FREDHOLM-STIELTJES INTEGRAL EQUATION* 


BY L. J. PARADISO 


The purpose of this note is to give conditions under which 
the Stieltjes integral equation 


(1) $(x) = f(x) +r f K(x, y)d6(y) 


has a solution ¢(x) of bounded variation. 

In the first theorem conditions on f(x) and K(x, y) are given 
under which the method of successive substitutions yields a 
solution of bounded variation for a limited range of values of X. 

With further restrictions on K(x, y), it is shown in the second 
theorem that the Fredholm method applies and the solution of 
bounded variation thus obtained is valid for all values of \ ex- 
cept for the characteristic values. 

Finally, an example is given to show that the more restrictive 
conditions on K(x, y) given in the second theorem are not suffi- 
cient to make the problem a special case of that treated: by 
Riesz. 

THEOREM 1. If 

(a) f(x) is of bounded variation, <b, 

(b) K(x, y), defined and bounded on R(aSxZb, aSySb), is 
continuous in y for each x and has a total variation in x for each 
y, Tx(y), which is a bounded function of y having the least upper 
bound Tx and 
(c) |x| < 1/Tr, 


then the function $(x) defined by the series 


(2) $2) = fla) +d K(x, y:)éf(n) 


f K(x, y:)d f K(y1, + 


is the unique solution of bounded variation of integral equation (1). 


* Presented to the Society, December 31, 1930. 
{ F. Riesz, Uber lineare Funktionalgleichungen, Acta Mathematica, vol. 41 
(1918), pp. 71-98. 


256 L. J. PARADISO [April, 


This theorem is proved by the usual method of successive 
substitutions. Let the function ¥(x) be defined by 


b 
f K(x, 9)df(9). 


Designate the total variations of f(x) and (x) by T; and Ty, re- 
spectively and let MZ | K(x, y)| . Then we have 


| ¥(x)| < MT; and S Tx-Ty. 


With the use of these inequalities it is found that series (2) 
obtained from equation (1) by repeated substitution converges 
absolutely and uniformly in x for all values of X satisfying the 
inequality |\| <1/T7x. The function thus defined, $(x), is found 
by substitution to satisfy equation (1). That it is the unique 
solution follows as a consequence of the fact that the method of 
successive substitutions when applied to the homogeneous 
equation 


yields as its only solution ¢(x) =0. 

THEOREM 2. If f(x) is of bounded variation in the interval 
(a, b) and K(x, y) together with K(x, y)/0x are continuous func- 
tions of x and y in R, then there exists a unique solution (x) of 
bounded variation of equation (1) for all values of X except for 
the characteristic values. 

This theorem can be proved by applying Fredholm’s method 
to equation (1). However, we shall employ the following trans- 
formation* which reduces the problem to the solution of a 
Riemann integral equation. Let 6(x) = @(x) —f(x). Then equation 
(1) becomes 


b b 
(3) (x) = f K(x, s)df(o) +f K(x, 


It is easily verified, with the given hypotheses on the func- 
tions involved, that each term of the right hand side of equation 
(3) is a continuous function of x and possesses a continuous 


* J. D. Tamarkin suggested to me the possibility of transforming equation 
(1) into a Riemann integral equation. 


= 
= 


1932.] FREDHOLM-STIELTJES EQUATION 257 


derivative. Hence on placing » Yi K(x, y)df(y) = F(y), we obtain 
from equation (3) by differentiation 


The function F’(x) is continuous, and, moreover, by hypothesis 
0K (x, y)/0x is continuous in x and y. Hence the Riemann integral 
equation (4) has a unique continuous solution @’(x) for all 
values of \ except for the characteristic values. The unique 
solution of bounded variation ¢(x) of equation (1) can thus be 
found. 

An example. Let ||f|| denote the maximum of the absolute 
value of the continuous function f(x) in (a, b). One of the con- 
ditions on the transformation 


= f K(x, 9)df(y) 


as given by F. Riesz* is that there exists a constant M such 
that for all continuous functions f(x) 


(5) 


The function K(x, y) defined in the following example satis- 
fies the conditions of Theorem 2 whereas inequality (5) given 
by Riesz is not satisfied by the transformation T[f]. We define 
K(x, y) to be a function of one variable, thus: 


K(x,0) =0, K(x, y) = ysin(x/y), 0<y 1). 


We next define a sequence of continuous functions f,(y) bounded 
in 2 and y. The mth member of this sequence is a function whose 
graph consists of a series of broken lines. These lines have a 
slope equal to zero in the interval (0, 1/(~+1)), while in the in- 
terval (1/(m+1), 1) they have a negative slope where the func- 
tion y sin (7/y) is negative and a positive slope where this 
function is positive. Let 5; denote 1 or 0 according as 7 is odd or 
even. Then 


fly) = (— + 1)y+ (— &, 
iW/(k+1)Sy8 1/k, (k 1, -, Mm), 


* F. Riesz, loc. cit., p: 72. 


= 


258 L. J. PARADISO [Aprii, 


and 
= bn, (0S 1/(n+1)). 
From this definition we have 
df, 
22. 
dy 


(1/(k +1) <y<1/k,k =1,2,---,m), 
= 0, (0< y <1/(n+ 1)). 


The transformation 
1 
0 
becomes from the definition of f,(y) and from (6) 


k=n 1/k 
(7) k=l Y 1/(k+1 


k=n 1/k 
= Dkk + 1) (— 1)*y sin (/y)dy. 

k=1 1/(k+1) 

The fi/i+yy sin(x/y) dy is in absolute value greater than the 
area of the triangle whose vertices are at (1/(k+1), 0), (1/k, 0), 
[2/(2k+1), (—1)* 2/(2k+1) ]. Since the area of this triangle is 
1/[k(k+1)(2k+1) ], we have 


f (— 1)*y sin (x/y)dy > 1/[k(k + 1)(2k + 1)]. 
1/(k+1) 


Consequently we get from (7) 


k=n k=n 


Tlf) > + 12k = +1), 


from which it follows that T[f,] becomes infinite with . Hence 
condition (5) is not satisfied and equation (1) is not a special 
case of that treated by F. Riesz. 


CORNELL UNIVERSITY 


k=1 k=1 


1932.] SYMMETRIC DETERMINANTS 259 


ON REAL SYMMETRIC DETERMINANTS WHOSE 
PRINCIPAL DIAGONAL ELEMENTS ARE ZERO 


BY W. V. PARKER 


A paper by L. M. Blumenthal in this Bulletin* is concerned 
with the following theorem. 
If the symmetric determinant 


(rij 


ll 
So 
o 


with ri;>0, (i, 7=1,2,3,4), «47, is different from zero, and the 
complementary minors of four of the elements in the principal 
diagonal vanish, then the complementary minor of the remaining 
element does not vanish. 

The purpose of this note is to establish a more general the- 
orem from which the above is immediately obtainable. This 
theorem may be stated as follows. 

THEOREM 1. Jf the real symmetric determinant, D= lass|, of 
order five with ai;=0, (¢, 7=1,2,3,4,5), has all five of its fourth- 
order principal minors zero, then D vanishes. 

Since the elements of D are real, we may write 


D=|68? #2 0 «# |, 


d2 22 w* 


where a,b,c,d,x,y,2,u,0,w are either real or pure imaginary num- 
bers. The theorem is trivial if all elements of the first row are 
zero. We will suppose then that at least one element is different 
from zero and without loss of generality we may assume a +0. 


* This Bulletin, vol. 37 (1931), pp. 752-758. 


260 W. V. PARKER {April, 


If now we subtract b?/a? times the second column from the 
third, c?/a* times the second column from the fourth, and d?/a? 
times the second column from the fifth, and then transform the 
rows in the same way, D assumes the form 


— 25*x* a*x*—d*z*— 5*s* 
1 
(1) D= ——| a*u?— b*y?—c?x? a? w?— d?y? — 
a? 
pz? — 2d?z? 


If we denote the fourth-order principal minors by M1, M2, 
M3, M;, Ms, we have 
M, = (xw+ yo+ 2u)(xw+ yo — 2u)(xw — yo + 2u)(xw — yo — zu), 
Mz = (bw+du + cv)(bw + du — cv)(bw — du + cv)(bw — du — 
M; = (aw+dy+cz)(aw+ dy — cz)(aw — dy + cz)(aw — dy — cz), 
M, = (av + dx + bz) (av + dx — bz)(av — dx + bz)(av — dx — bz), 
M; = (au+cx+t by)(au+ cx — by)(au — cx+ by)(au — cx — by). 
If M;s=M,=M;=0, then a*u?=(cx+by)*, = (dx +bz)*, and 
a’w* = (dy +cz)*. Substituting these values in (1) we get 

i, —1 


(8b2c2d?.x? y2z2) 


a? 


Q) D= + 
+ 


2[(+ 1.)(4 1u) + 1](— /a?, 


.where +1, is +1 or —1 according as a?u?=(cx+by)? or au? 
=(cx—by)? and similarly for +1, and +1,. From this form 
it is evident that if b,c,d,x,y, or z is zero, D is zero and hence 
our theorem is true for this case. Let as assume now that J,c,d, 
x,y,z are all different from zero. Then in order that D be different 
from zero it is necessary that the above signs be taken all plus 
or one plus and two minus. Since M, and M, are even functions of 
u,v,w, we may take only positive square roots on both sides. 

CasE I. The three signs plus. In this case au =cx+by, av=dx 
+bz, aw=dy+cz. Substituting these values in the expressions 
for M, and M2 above we have 


M, = — (dxy + cxz + byz)(16bcd x? y?z?) /a4, 
Mz = — (cdx + bdy + bcez)(16b7c?d?xyz)/a‘. 


1932., SYMMETRIC DETERMINANTS 261 


Since by supposition b,c,d,x,y,z are all different from zero, in 
order that M,=M,=0 it is necessary that dxy+cxz+byz=0 
and cdx+bdy+bcz=0, from which it follows that c?x?+bdcxy 
+ b*y?=0. That is cx/(by) must be a complex cube root of unity, 
but this is impossible since cx/(by) is either real or a pure imag- 
inary number. Hence the theorem is true for this case. 

CasE II. One sign plus and two minus. There are three possi- 
bilities here. We will carry the three through simultaneously 
indicating the corresponding steps by (1), (2), (3) for each case: 


(1) av = dx + bz, (3) 


aw = dy — cz, 


av = dx — bz, 


ale au = cx— by, 
av = dx — bz, (2) | 


aw = dy+cz. 


The corresponding expressions for M, and M;z are 


(1) M, = — (dxy — cxz — byz)(16bcdx* yz") /a4, 
(2) M, = — (cxz — dxy — byz)(16bcd x? yz?) /a4, 
(3) M, = — (byz — dxy — cxz)(16bcdx* y*z) /a4, 
(1) Mz = — (bez — bdy — cdx)(16b*c?d?x-yz)/a', 
(2) M2 = — (bdy — bez — cdx)(16b?c?d? xyz) /a4, 
(3) M2 = — (cdx — bez — bd y)(16bc*d?xyz)/a*. 


Since b,c,d,x,y,z are all different from zero, in order that M, 
=: M,.=O it is necessary that 


(1) + bexy + = 0, 
(2) d*x? + bdxz + bz? = 0, 
(3) d*y? + cdyz + c?2? = 0. 


None of these is possible for the same reason as in Case I. 

We have shown therefore that if M@;= M,=M,;=0 and D0, 
then not both M, and Mz are zero. Hence, if M;=M2=M; 
= M,=M;=0, D vanishes. 

If the elements outside the principal diagonal of D are all dif- 
ferent from zero, we see from (2) that if 4@;=M,=M,;=0 then 
the signs must be taken all minus or two plus and one minus in 
order for D to vanish. In each of these cases M;= M2.=0. We 
may state, in this case, therefore the following corollary. 


__ 


262 W. V. PARKER [April, 


Coro uary. If four of the principal minors of D are zero and 
the fifth is not zero then D is not zero. 

As an immediate consequence of Theorem 1 we have also the 
following result. 


THEoREM 2. If D=|a;;| is a real symmetric determinant of or- 
der greater than four with a;;=0, then if all fourth-order principal 
minors of D are zero, D vanishes. 

Any fifth-order principal minor of D is of the form of the 
determinant under Theorem 1 and hence is zero if all fourth- 
order principal minors vanish. Hence all principal minors of 
orders four and five are zero and therefore the rank of D is three 
or less.* 


THEOREM 3: Jf D= | a;;| is a real symmetric determinant of or- 
der n, n>5, with a;;=0, and M is any principal minor of D of 
order n—1, then if all fourth-order principal minors of M are zero, 
D vanishes. 

By Theorem 2, the rank of M is three or less. We may suppose 
now that M is the minor in the upper left hand corner of D. 
When D is expanded according to the product of the elements of 
its last row and last column it is expressed as a sum of products 
of these elements by minors of M of order n— 27. But all minors 
of M of order n—2 are zero and hence D vanishes. 


MissIssIPpPI WoMAN’s COLLEGE 


* Boécher, Introduction to Higher Algebra, p. 57, Theorem 2. 
1 Bécher, loc. cit., p. 29, Theorem 3. 


1932-] ARITHMETICAL IDENTITIES 263 


ON THE DEGREE OF GENERALITY OF A CLASS 
OF ARITHMETICAL IDENTITIES 


BY E. T. BELL 


1. Introduction. In a recent paper, A class of arithmetical iden- 
tities,* Oppenheim, in commenting on a special case (m not 
square) of the identity A below, which is due to Uspenskyf, 
states that the conditions on the function H are “needlessly re- 
strictive,” and he gives a “general identity which will include 
such identities as these,” (A, with 7 not square, and the new 
identities of Uspensky which involve incomplete numerical func- 
tions). { It is indeed readily seen that Oppenheim’s identity B 
implies all of those of Uspensky. In view of the remarks quoted, 
it is perhaps in order to give a detailed proof, by general methods 
adapted to the specific functions and partitions in A, B, that 
Uspensky’s A alone implies Oppenheim’s B. The identities A, B 
are thus of precisely the same degree of generality. 


2. The Identities. Each of the functions H(w, u,v), F(w, u, 2), 
o(w, u, v) is finite and single-valued for integer values of w, u, 2, 
and beyond this restriction and those stated in (1), (4), (5) for H, F 
o, respectively, the functions are entirely arbitrary. 


A. (UsPENskKy). Let 
(1) H(w, u,v) = — H(— w, u,v) = H(w, — u, — v); H(0, u,v) = 0. 
Then, if the >> on the left of (3) refers to all integers x, y, z such that 
(2) y>O0, z>0, 


where nis an arbitrary constant integer, 


* Quarterly Journal of Mathematics, Oxford Series, vol. 2, No. 7, 1931, pp. 
300-303. 

{ First published, apparently, in Kharkof Mathematical Society Communi- 
cations, (2), vol. 15 (1915-1917), pp. 81-147, (Russian). See Jahrbuch iiber die 
Fortschritte der Mathematik, vol. 48, p. 1350. 

t This Bulletin, vol. 36 (1930), pp. 743-754. Oppenheim, loc. cit., p. 230, 
corrects the misprints. The considerations of the present note apply also to the 
identities of Mordell, which Oppenheim cites. 


— 


(3) +2,2— y, 2) — — 2x, 2x + 2y —2, 


2m—1 


[H(2m, 2m — 2j,m 2H (2m 2m — j,m)], 


j=1 
where e(n) =1 or 0 according as nts or is not square, and m= | tz a 
B. (OPPENHEIM). Let 


(4) F(w, u,v) + F(w, — u, — 2) 


+ F(— w, u,v) + F(— w, — u, — v) = 0, 


(5) o(w, u,v) = o(— w, u,v) = — — 2). 
Then 

(6) 9], 
where >. on the left refers to all integers x, y, 2 such that 


(7) o(y+2,2-—y,x) =N,y >0,2>0, u(z — y) vx, 


where N is an arbitrary (real or complex) constant number, p, v 


are any real constant numbers, and >. on the right refers to all 
integers &, n, € such that 


(8) =N, OF E+n =Omod2, wn > 


If 6=N has an infinity of solutions, F is assumed to be such that 
the infinite series concerned are absolutely convergent. 

In a previous note* I showed that A implies and is implied by 
the identity 


a3 bs C3 
a3 bs c3| = 0, 
be C2 


= 02x —y— 22), c; = 0,(y). 


The connection of this and the present note with the seriesf on 
Quadratic partitions in course of publication in this Bulletin is 
indicated below, in §4. 


3. Implication of B by A. By the definition of H, we may re- 
place H in (3) by Hi, where H,(w, u, v) is any function which 


* This Bulletin, vol. 32 (1926), pp. 682-688, especially p. 685. 
t See, for example, this Bulletin, vol. 37, pp. 870-875. 


264 E. T. BELL {April, 


| 

| 


1932.] ARITHMETICAL IDENTITIES 265 


is single-valued and finite for integer values of w, u,v, and which 
satisfies the parity conditions (1). We shall take 


(9) H,(w, u,v) = y(w, u, v)V,.(u, v)H(w, u, 2), 


where, ®y(w, u, v) =1 or else =0, according as $(w, u, v) = N or 
o(w, u, v)~ N (N as in B), and where V, ,(u, v) =1 or 0 accord- 
ing as uu—w+0 or pu—w=0. This choice is permissible, since 
u,v) = w, u,v) = Sy(w, — u, — 2), 

as is evident from the definition of the functions. Having made 
the above substitution in A, we then replace H(w, u, v) in the 
resulting identity (or at once in (9), before making the sub- 
stitution), by tts instance F(w, u, v)+F(w, —u, —v), where F is 
as in (4). Thus A implies 


(10) +2,2- 9,4) —4)] 


1 
— 2 — 2x, 2x + 2y —2,x+ 9) 
+ F(z — 2x, — 2x — 2y+2,-—x—-y)] 
= e(n) >> |[F(2m, 2m — 2j,m —j) +F(2m, — 2m+ -m+j)] 
3 


— 2(n) >> [F(2m — j, 2m — j, m) 
4 


+ F(2m — j, — 2m + j, — m)], 


where the limits 7=1 to 7=2m—1 are to be supplied to we a, 
and >>; refers in an obvious way to all the simultaneous integer 
solutions of (2) and (11.2), 
(11.1) — y) 
(11.2) o(2—2x, 2x+2y—z2, x+y) = N, p(2x+2y—2) v(x+y); 
(11.3) (2m, 2m — 2]7,m—j)=N, p(2m — v(m); 
(11.4) — j, 2m — j, m) = N, — j) ~ vm. 

By (2), (5), (11.1) the >>; term in (10) is equal to 
(12) 2 2,3 — 9, x). 

1 


| 
| 


266 E. T. BELL [April, 


By (5), if (w, u, v)=(W, U, V) is an integer solution of 
o(w, u, v) = N, then so also is (aW, bU, bV), where each of a, b is 
either of 1, —1, and for amy definite (a, b) as (w, u, v) runs over 
all solutions, so also does (aW, bU, bV). Apply this to (11.3), 
(11.4) with (a, 6) =(—1, 1). Then >>; in (10), for i=3, 4, is half 
what d: becomes when F(w, u, v) is replaced by F(w, u, v) 
+ F(—w, u,v). Hence, by (4), 


(13) 


Consider now the set S of all solutions (x, y, 2) of (7), and let 
S, be that subset of S which contains all those solutions, and 
only those, which satisfy (2). Then S,, S, are mutually exclusive 
if p¥q, and S is the logical sum of S,(m=1, 2, - - - ). Any partic- 
ular S, may be the null set. In (10) take n=1, 2, - - - , and add 
corresponding members of all the resulting identities. Then, 
since (10) is subject to (2), (11.1)-(11.4), it follows by the 
above and (12), (13) that A implies 

DF (y + 2,2 — x) 
(14) [F(z — 2x, 2x + 2y — 2, x + 9) 

+ F(z — 2x, 2x y)], 
where > in the first line refers to (7), and >> in the second line 
to (11.2) with the additional conditions y>0, z>0. 

To reduce (14) to the form (B), we make the substitution 
(15) (2 — 2x, 2x + 2y — 2, x + y) = (ak, dn, 5S), 
where each of a, b is a definite one of 1, —1. Then 
(16) (2x, 2y,2) = (2bf — at — bn, af + by, 2bf — bn). 

By the derivation of (14), it is sufficient to consider (14) sub- 
ject to (2). Since y, z are integers and y>0, z>0, we see by (16) 
that the right member of (14) in the case considered is equal to 


(17) 19 

the >> referring to all integer solutions £, 7, ¢ of $(£, 1, O=N 
which are such that 

(18) n = (bf)? + + bn) (ak — bn), 

(18.1) mod 2, (18.3) >0, 
(18.2) at+ b(n) > 0, (18.4) b(un — vo) 0, 
for any definite (a, b) as defined. 


1932.] ARITHMETICAL IDENTITIES 267 


Let us now replace (18.4) by its equivalent 
(19) b(un — v&) > O or b(un — vf) < 0. 
In the first of (19) take b=1; in the second )=—1. Then the 
sum (17) in the case being considered is half the same sum over 
all integers £, n, ¢ such that $(£, 7, £) =, subject to the sets of 
conditions (in the two columns) 
(20) n = (+ $)? + + )(aé — 
n = (— + — n)(aé + 7), 
(20.1) at—n=0 mod 2, 


(20.2) ai +7 > 0, aé—7 > 0, 
(20.3) 2° <0, 
(20.4) un — > O,7 pn — > 0. 


The conditions (20.3) may be suppressed, since the one miss- 
ing possibility, 25-1 =0, contributes zero to (17). This is seen 
as follows. If 25 =n, then by (20.1), Eis even; by (20), 2 is square, 
a&=+2m (m as in A). Choose a=1, or a=—1, according as 
—>0, or <0. Adding half the respective contributions to (17), 
we see by (4) that the total contribution vanishes. 

Combining all results so far, taking a=1 in both columns of 
(20)—(20.4), and summing over both for all values of 1, we see 
that A implies B with the exception that the condition +740 
in (8) is here replaced by £+7>0. To complete the proof that A 
implies B it is therefore sufficient to show that if £+7+0 be 
replaced by £+7<0 in (8), then the right of (6) vanishes. This 
follows by (4), since the conditions —§ > 7, +7 <0, un > vf 
are invariant under the substitution (£’, n’, ¢’)=(—&, 7, 0), 
where £, n, ¢ are subject to the conditions (8) with +70 re- 
placed by £+7<0, and since (£’, n’, ¢’) runs over all solutions 
of n, =N when (£, n, £) does, by (5). 

4. Parity Transformations. The equivalence of A, B is a simple 
instance of the invariance of a parity identity under a parity 
transformation.* Such a transformation leaves invariant the 
parity of the functions in the identity, or that of the given parti- 
tion, or both. In this instance both are used; in (9) and in pass- 
ing to (10) the transformation is on the function; in substituting 


* For some account of these, see E.T. Bell, Algebraic Arithmetic, Colloquium 
Publications of this Society, vol. 7, 1927, and earlier papers in the Transactions 
of this Society, where such transformations were applied in simple cases to the 
arithmetic of higher forms. 


268 E. T. BELL {April, 


o(w, u, v) for w®+uv, in which $(w, u, v) and w*+wuv are of the 
same parity, it is on the partition. 

If we define (as seems reasonable) the proposition P to be a 
generalization of the proposition Q when and only when P im- 
plies Q, and Q does not imply P, it can be proved that it is im- 
possible to generalize a parity identity by parity transforma- 
tions.* For this reason, in the notes appearing currently in this 
Bulletin on Quadratic partitions, the discussion is limited to such 
partitions and to parity functions which, so far as feasible, are 
unrestricted by conditions which can be introduced by parity 
transformations. The passage from 1 to N also is a parity trans- 
formation, since any not identically zero linear homogeneous 
function of functions having a given parity has the same parity. 

The linear transformation (15) would be suggested naturally 
as a possible means of transforming A by that which is given 
by the addition theorem for the thetas used in deducing A from 
the identity stated in §2, end. All such transformations go back 
ultimately to the orthogonal one on four variables which makes 
possible Jacobi’s theta formula. It can be shown (and will be 
discussed in another note) from theorems of Dickson f on certain 
automorphs of sums of like powers, higher than the second, that 
parity identities referring to partitions of degree higher than 2 
not obtainable by parity transformations from quadratic parti- 
tions do not exist. 

Finally it may be noticed that convergence is irrelevant if par- 
ity identities be given their most elementary significance, that 
of equalities between sefs of vectors having parity. This was 
observed essentially by Glaisher,f although he did not state it 
explicitly. 


CALIFORNIA INSTITUTE OF TECHNOLOGY 


* The like applies to the broader class of linear homogeneous transforma- 
tions of the coordinates in the vector (or matric) variables of a parity function, 
although the parity of the transformed function, considered as a function of the 
new variables, is in general different from that of the function in the old vari- 
ables. All such transformations amount merely to linear homogeneous trans- 
formations of the indeterminates in the trigonometric identity which is equiva- 
lent to the parity identity. 

+ Mathematische Annalen, vol. 52 (1899), pp. 561-582. 

t Proceedings of the London Mathematical Society, vol. 22 (1890-91), pp. 
359-410. In Dickson’s History, this work is reported in vol. 1, p. 310, under 
Sum and Number of Divisors. It might have been cited also in vol.2, Chap. 11. 


1932.] THE MOMENT PROBLEM 269 


ON THE MOMENT PROBLEM FOR A 
FINITE INTERVAL 


BY T. H. HILDEBRANDT 


Hausdorff* and Schoenberg have shown that a necessary and 
sufficient condition that there exist a function of bounded varia- 
tion x(t), such that we have, for every 7, Un = f't"dx(2), is that 
>%,-0(%) |A"-"u» | form a bounded sequence. The fact that if 
such a function x (¢) exists, then = 1—#)"dx(t), suggests 
that perhaps the sufficiency part of this theorem can be easily 
demonstrated by using the Bernstein polynomials and the Riesz 
theorem on linear functional operations on the set of continuous 
functions. 

For any function f(#) on (0, 1) the mth Bernstein polynomial 
is defined by the formula 


(nN 
Rif) = 2, ( ) — 
m=0 m 
and it is well known that the polynomials B,(f) converge uni- 


formly to f(#) if f is continuous. Further, it can be easily shown 
by induction{ that if P;(#) is any polynomial of degree k, then 


B,(Pr) = Prt) + 
i=1 
where A ;;(¢) are polynomials of degree at most k and are inde- 
pendent of x. 
Define now a linear operation U on polynomials P;(¢) by the 


condition 


k 
U(Px) = U(ao ayt a,t*) = 
i=0 


* Mathematische Zeitschrift, vol. 16 (1923), p. 238. 

¢ This Bulletin, vol. 38 (1932), p. 72. 

t Following the suggestion in Pélya and Szegé, Aufgaben und Lehrsdtze 
aus der Analysis, vol. I, p. 6, Ex. 40; and p. 66, Ex. 144. 


= 
— 


270 T. H. HILDEBRANDT [April, 


Then for any function f, 


= 


m=0 


As a consequence, for every 1, 
| U(BA(f))| < F-M, 


where F is the least upper bound of HO) | on (0, 1) and M is the 
least upper bound of >°%~o(%,) |A"-"p. |. Hence if P; is any 
polynomial of degree k, then 


| | s| | + | 


so that 
| U(P:) | Smax|P;|-M. 


It follows from this inequality that if f is any continuous 
function on (0, 1), then the sequence of numbers U(B,(f)) sat- 
isfy the Cauchy condition of convergence, that is, converge toa 
finite limit, which limit we denote by U(f). Obviously U(f) is 
linear in f and |U(f)|<F-M. Hence by the Riesz theorem on 
linear limited operations, there exists a function x(¢) of bounded 
variation, such that 


U(f) = f thatis, = f rdx(t). 


THE UNIVERSITY OF MICHIGAN 


| 
= 


1932.] NEW DIOPHANTINE AUTOMORPHISMS 271 


NEW DIOPHANTINE AUTOMORPHISMS* 


BY LEONARD CARLITZ{ 


1. Introduction. In this note we consider the construction of 
an infinite set of homogeneous polynomials f such that 


where 


are homogeneous of degree h. In that case, f is said to admit of a 
diophantine automorphism.{ The construction depends on a 
very simple principle connected with invariant theory. 

Let ¢ bea binary form of degree 5: 


3 6 8-1 3 


let g be a quadratic covariant of @ of degree p; then y, the dis- 
criminant of q, is an invariant of the ground-form ¢ of degree 2p. 
By the property of invariance, if under the linear transforma- 
tion 


(3) Xy = + = 1X1 + 
of determinant A = ay;022 — the form becomes 
= Ax = free, 
and becomes = (A), then 
(4) ¥(A) = A’*Y(a). 
Now putting g=qux? , we take in (3) 
11 = G12, O12 = O21 = — Qu, O22 = — 


Evidently (4) becomes 


* Presented to the Society, April 11, 1931: 
National Research Fellow. 
TE. T. Bell, this Bulletin, vol. 33 (1927), pp. 71-80. 


= 


2i2 LEONARD CARLITZ [April, 


(5) ¥(A) = y****(a), 
where, by virtue of the specialized transformation (3), 
(6) A; = , a) 


are homogeneous of degree 6p +1. 

Clearly (5) and (6) furnish a solution of (1) and (2). And (for 
y~square) we can assert immediately that it is surely non- 
trivial if 6 is odd. Indeed the solution is trivial only if (6) reduces 
to 


A;= 


2. Expression of q Symbolically. It is here convenient to make 
use of the symbolism of the theory of invariants.* Let us set 


¢= = + g = = + 
so that upon applying (3), in specialized form, we obtain 
+ g? Xe) + a2(— gPX1 — 
(a1g2 — + 2X2) = (ag)qx, 


a,—Ax 


(7) 


and therefore becomes 


(é—1) (é—1) 
= (ag)(az’)--- (aq )gxgx---qx 


where g, g’, - - - are equivalent symbols. Similarly 

qz—> + + gu? Xi — = (qq')qx’, 
and therefore g becomes 

(8) Q = = 


as follows immediately from the fundamental identity of in- 
variant theory. 


3. Consequences of (7) and (8). The equations (7) and (8) im- 
ply an interesting result concerning the transformation (6). 
Clearly all the operations performed on ¢@ may be performed on 
®. By (8), the transformation (3), properly specialized, changes 
(7) to 


(aq) = = prt?) 


* See Grace and Young, Algebra of Invariants, 1903, Chap. 1. 


= 


1932.) NEW DIOPHANTINE AUTOMORPHISMS 273 


and 


— Set2) /2g. 


that is, ® is transformed into ¢ multiplied by a factor free of x. 
This is equivalent to 


(9) A,(A) = 


where the exponent is surely integral, for when 6 is odd p must 
be even. It follows that the equation (6) defines a Cremona trans- 
formation of period two. 

Now from (9) 


0A; 


0a; 


| 0A (A) 
aA; 


O( p> et2)/2q,) 
| 0a; 


since the determinants involved are jacobians. But the jacobian 
on the right is easily shown to be equal to 


(dp + 1) 
Hence, if W is algebraically irreducible, 


0A; 
0a; 


= 


where c denotes a numerical constant. 


4. Special Values of 6. We now consider some special values 
of 6. If 5 be odd, it is evident, to begin with, that a g is always 
furnished by the covariant 


Ks = (¢, 


the (6—1)th transvectant of ¢ with itself. The quantity K3_; is 
an irreducible covariant, and its discriminant 


= 3(Ks-1, Ks-1)? 


is an irreducible invariant of degree four. By what precedes, the 
form y admits of a non-trivial automorphism. 

Take 6 =3. Then Kz is the Hessian of ¢, and y is the discrim- 
inant of @ as well as of Ke. By (7) we must here consider the 
transformed form 


274 LEONARD CARLITZ [April, 

= + (aq’)*g2? — (q'q")*a2) 
= (aq)(aq’)*gz — ¥-(ag)a2qz. 


But the first term involves ((¢, q)?, g)! and (¢, q)? vanishes, as 
can be seen directly or by making use of the known result that 
the cubic has no linear covariants. Hence the transformed form 
is, but for a numerical factor, the product of the discriminant 
into the cubicovariant. In this case, then, (6) reduces to 


1 
= Silo, a3), 


where f; is of degree three. 


5. The Case 6=5. When 6=5, there are three quadratic co- 
variants in the irreducible set.* The simplest is the form K, 
=(@, @)*, which is generally denoted by 7. For the transformed 
form (7) we have here (i=i2 =i7?= ---) 


(10) (ai)(ai’) -- - --- 
= }(ai)i,(2(ai’)?-i — Aa?)(2(ai”)?-i — AaZ) 
= + AF, 


where A = (i, 7)? is used in place of y, and F is an (integral) co- 
variant whose exact form is immaterial. Now 


(ai) = (((¢, i)! 
Gj, i)! (a, i)! = — £, 


where j, a and @ are irreducible covariants of order 3, 1, 1, re- 
spectively.j It is then clear that (10) is not a multiple of A and 
therefore in this case the transformation (6) is not reducible; 
that is, it is actually a Cremona transformation of degree 2-5+1 
=11. 

We have defined 7 above as —(@, 7)?; the second transvec- 
tant of j with itself is a second irreducible quadri-covariant, 
tT =(j,j)*. We now have, in place of (10), 


* See Grace and Young, loc. cit., p. 131. 
+ See Grace and Young, loc. cit. 


= 


1932.] NEW DIOPHANTINE AUTOMORPHISMS 275 
(11) (ar) (ar’)r, 


— — Ca?) 
(ar) (ar’)?(ar’’)?7, + CF, 


where C =(r, 7)? replaces y. But 
(ar) = (((9, 7)*, 


(¢, 7)? = — la— 3A;. 


It is easy to show, using the tables of transvectants in Grace and 
Young, that (ta, 7)? is expressible linearly in terms of Ba and 6 
(B is an invariant, a and 6 are linear covariants in the irreduci- 
ble set of concomitants of $); (Ba, r)! and (6, r) are expressible 
in terms of C8 and By, y a third linear covariant. It is then 
evident that (11) is not divisible by C, and therefore the trans- 
formation (6) is here also of maximum degree (6-5+1=31). 

The remaining quadratic covariant 9=(z, 7) may be treated 
in exactly the same way, and again it appears that (6) does not 
reduce to a transformation of lower degree. 


6. The Case 6=7. When 6=7, there are several quadratic co- 
variants. We limit ourselves to the Kg=(¢, ¢)*=q, Y=(q, q)?. 
For this case, (7) becomes 


Now (aq) (aq’)?(aq’’)?(aq’’’)?gz=(¢, g*)’ and this is a member of 
the irreducible set of concomitants of the septimic.* Accordingly 
(12) is not divisible by ¥, and again (6) is of maximum degree 
(2-7+1=15). 

In general, if 6 be odd (=2+1) and greater than three, then 
for g= Ks-1, (7) reduces to 


(6, qh gt YF, 


and it seems likely that this is not divisible by y, so that (6) 
would in this case always be of maximum degree = 26+1. 


7. Even Values of 5. When 6 is even it is necessary to consider 
covariants of somewhat higher degree. Thus for 6=6 the sim- 
plest g ist 


* See F. von Gall, Mathematische Annalen, vol. 31 (1888), p. 320. 
{ See Grace and Young, loc. cit., p. 156. 


276 LEONARD CARLITZ [April, 


q = i)‘, i = @, 
Then for (7) we get as above 
+ YF, 
where y = (I, But 
(al)?(al’)*(al’")? = (@, 


which is a member of the irreducible set of concomitants distinct 
from y. 

8. Conclusion. The method outlined in §1 is by no means re- 
stricted to binary forms y. It is indeed obvious that starting 
with any quadratic covariant of a form ¢ in any number of varia- 
bles we may arrive at equations (5) and (6). The transformation 
involved will certainly not be trivial if the degree of @ is not 
divisible by the number of variables.* 

An obvious instance of this is furnished by quadratic forms, 
and indeed it is sufficiently obvious that their discriminants have 
the automorphic property. Another simple though less obvious 
instance is furnished by the quadri-covariant of the quaternary 
cubic (¢=a2 =b3 = -- -): 


(abcd) (abce) (adef) (bdef)c-f-. 


By the remark made in the last paragraph, this will certainly not 
lead to a trivial transformation. 


CALIFORNIA INSTITUTE OF TECHNOLOGY 


* It is of course assumed that the discriminant of the quadri-covariant of 
¢ is not a kth power, k being the number of variables. 


1932.] A DIOPHANTINE EQUATION 27 


NOTE ON THE DIOPHANTINE EQUATION 
ax? + by? + cz? + d#? = 0 
BY L. J. MORDELL 


Suppose that the constants a, b, c, d are integers none of 
which is zero, and that 
I. a, b, c, d have no squared factors; 
II. no three of a, b, c, d have a common factor, that is, |a, b, c] 
=1, etc. 
Then, if we suppose the restrictions I, II apply throughout 
the paper, the equation 


(1) ax? + by? + cz? + d#? = 0 


has integer solutions not all zero, if, and only if, 
III. a, b, c, d are not all of the same sign, 
IV. every odd prime factor pa of [a, b| for which 


@ 


must also satisfy 
Pa 
(3) = 1, 


together with five corresponding conditions derived by permuting 
the letters. The symbol is that of quadratic residuacity. 

V. (1) Either abcd =2, 3, 5, 6, 7 (mod 8), 

V. (2) or abed=1 (mod 8), a+b+c+d=0 (mod 8); 
these mean that we must exclude 


a = b (mod 8), c = d (mod 8), a = c (mod 4), 


and the corresponding sets derived by permuting the letters; 
V. (3) or two of a, b, c, d are even, say a, b, and we must have 
V. (3.1) either tabcd =3, 5,7 (mod 8), 
V. (3.2) or abcd =1 (mod 8), and }a+3b+c+d=(c?d?—1)/2 
(mod 8); these mean that we must exclude the sets 


i b (mod 8), ¢ = 3d (mod 8), 


2 


278 L. J. MORDELL [April, 


or 

1 i 1 

c = d (mod 8), > = (mod 8), 
or 

1 5 1 

¢ = (mod 8), ¢ (mod 8). 


If in equation (1), we impose the restrictions that 
(4) [z, t, a,b] = 1 or2 


together with the five similar ones obtained by permuting the 
letters, the only difference in the conditions is that IV must be 
replaced by 

VI. —cd is a quadratic residue of |a, b|, writtenas —cd R [a, b], 
and so of every odd prime factor of |a, b]; together with the five 
conditions obtained by permuting the letters. 

In a recent number of this Bulletin, R. G. Archibald* deduces 
the conditions III, IV, V in a slightly different form (it is not 
difficult to see that his II is equivalent to my IV) as a particular 
case of an important general result due to Hasse. He also refers, 
inter alia, to my recent paperf where I show that III, V, VI are 
the necessary and sufficient conditions for integer solutions of 
(1) subject to the restrictions typified by (2). 

The first results on (1) were due to Meyer who assumes not 
only that a, b, c, d satisfy the conditions I, II, which it may be 
remarked involve no loss of generality and simplify the results, 
but also the further condition that no primes p,, exist for which 
(2) holds, that is, his results applied to (1) limited only by I, II 
really give the solutions as restricted by my condition (4). Both 
his results and mine are really general in that it is very easy to 
transform a given equation into the required form. But the 
methods employed in my paper are so elementary, depending 
only on Legendre’s criteria for the solvability of the equation 


* Criteria for the solution of a certain quadratic diophantine equation, this 
Bulletin, vol. 37 (1931), pp. 608-614. 

1 The condition for integer solutions of ax*+-by?+cz*+dé=0, Journal fiir 
Mathematik, vol. 164 (1931), pp. 40-49. References to the literature are given 
here and also in Archibald’s note. 


1932.] A DIOPHANTINE EQUATION 279 


Ax? + By? + Cz? = 0, 


that it may be desirable to remove the restriction (4) and to 
deduce the condition IV instead of VI. 

Let pas, pis, etc. be the odd prime factors of [a, b]. Then (4) 
is equivalent to 


[z, 4] 4 (mod fas or Par Cte. 


that is, the 2 in (4) is really no restriction since c, d have no 
squared factors. 

We now find the condition that (1) should be solvable subject 
to the six restrictions typified by (4) except that for a particular 
odd factor g of [a, b], we are not excluding [z, t]=0 (mod q). 
If we set 


t=q'’, y=y’, 
the equation (1) becomes 
(5) a’ x’? + + c's’? + = 0, 
where a’=a/q, b’=b/gq, c’ =cq, d’ =dq. Since a’, b’, c’, d’ still 
satisfy I, II, we may assume that [x’, y’, 2’, t’] =1 and so no 


three of x’, y’, 2’, t’ can have a common factor. Hence x’, y’ 
are both prime to q, that is, 


[x’, 9’, d’] = 1 or 2. 
Hence it is clear that (5) is subject to the appropriate six re- 
strictions typified by (4) and we can apply V and VI. But the- 


corresponding condition V for (5) is exactly the same as V since 
a’'b’c'd’ =abcd and 


b 
a’ +B +d! =—+— + cq + dq = 0 (mod 8), 
q q 


since g?=1 (mod 8). 

Since c’d’ = cdq?, a’b’ =ab/q’, a’c’ =ac, etc., the corresponding 
ing condition VI, on noting that —(ab/q?)Rq|c, d] means that 
—(ab/q?)Rq and —(ab/q?)R[c, d], becomes 


a 5 ab 


together with the five conditions 


= 


280 L. J. MORDELL [April, 


7) — abR|c,d|, — acR[b,d], — adR[b, cl, 
— bcR[a,d], — bdR{a, 


If [a, b] is divisible by a prime p.s which is not a factor of q 
and for which 


(8) 
Pad 


(6) will not hold. Hence (6) can be satisfied only if g is divisible 
by every prime ., for which (2) holds. For these primes, the 
condition — (ab/q?)Rq becomes 

ab 

If there are such primes ~.,, VI shows that (1) is not solvable 
unless [z, ¢]=0 (mod p.), and that it is solvable if g is the prod- 


uct of these #25. Hence the equation (1) has solutions subject 
to the five restrictions 


[x,y,c,d])= 1,2; [x,2z,6,d] = 1,2; [x, t, b, c] = 1, 2; 
[y,2,a,d]= 1, 2; Ly, t, a, c] “i 1, 2; 


(9) — 


(10) 


if and only if III and V hold and if every odd prime factor pa 
of [a, b] for which (2) holds also satisfies (3), together with the 
five conditions (7). 

It is clear that we can proceed now with equation (1) as re- 
stricted by (10) and remove in turn the restrictions [x, y, ¢, d] 
= 1, 2, etc. Thus if we now allow 


[ x, y, ¢; d| =r>2, 


and put 


then 


If this has solutions subject to the five restrictions typified by 
(10), that is [x’’, y’’, c’’,d’’] =1, 2, etc., the first condition in (7) 
becomes —abr*R(c/r, d/r) while the other four remain un- 


a” = ar, b" = by, = 


1932.] A DIOPHANTINE EQUATION 281 


changed. Hence r must include among its factors the odd prime 
factors p’.a, of [c, d], if any, for which 


—ab 
Pea 


To the condition below (10), must be added the one arising since 
[a’’, b’’|=r|a, b], that if 


(11) ( — 1 


then also 
72 
— abr?/p, 
Pca 
that is, 


— ab 
(13) ( Pla ) 


But then there is no need to take p.4 as a factor of r, and we 
must have 


42 
(14) 
Pea 

Hence (1) has solutions subject to the last four restrictions 
in (10) if and only if III and V hold together with the last four 
conditions in (7), and also that (3) holds when (2) holds; and 
(14) holds when (13) holds. We thus arrive at the condition IV 
when (1) is not restricted by the six conditions typified by (4). 

It may be remarked that condition IV could be deduced more 
simply by noting the fact pointed out in my paper that my 
method showed that necessary and sufficient conditions for the 
solvability in integers not all zero of f(x, y, 2, 4) =0, where f is 
an indefinite quaternary (or for that matter a ternary) quadratic 
form with integer coefficients and non-zero determinant, is that 
the congruence f(x, y, z, 4) =0 (mod p”) should be solvable with 
[x, y, z, mod p, for all primes p= 2 and integers n>0, and 
that this requires a condition for only a finite number of values 


282 L. J. MORDELL [April, 


of p, n. Thus for (1) we need only take p"=2', p3;, p2.,- ++. The 
condition V arises from p" = 2‘, and the condition IV on taking 
p"=p., and applying the result that the congruence 


Ax? + By? + Cz? = 0 (mod p*) 


is possible for all odd primes p if ABC#0 (mod p). 
Archibald’s numerical example is 


1102? + 770y? — 2? — #2 =0. 
Here only the primes p,» exist and p,, = 5, 11, since 


[110, 770] = 0 (mod fas). 


For pa = 11, 


but 


—¢ 
Pad 
and no other condition arises. Hence IV is satisfied. Clearly V is 
satisfied since 
4a = — 1 (mod 8), 35 = 1 (mod 8), c = d (mod 8). 


Hence the equation is solvable. Archibald gives the solution 
x=2, y=1, s=11, t=33. 


THE UNIVERSITY, MANCHESTER, ENGLAND 


(=) 
For pa =5, 


1932.] METRIC GEOMETRY AND DETERMINANTS 283 


NOTE ON AN APPLICATION OF METRIC 
GEOMETRY TO DETERMINANTS 


BY L. M. BLUMENTHAL 


1. Introduction. This note refers to a previous paper* pub- 
lished in this Bulletin. On page 754 of that paper, the following 
theorem is stated. 


THEOREM. If the symmetric determinant 

1 frie 114 
ror O fog foal, (riz = ii), 


134 


Ya. O 


with r:;>0, (i, 7=1, 2, 3, 4), 74j, is different from zero, and the 
complementary minors of four of the elements in the principal 
diagonal vanish, then the complementary minor of the remaining 
element does not vanish. 

In order to prove this theorem, two cases, A and B, were 
considered. In Case A it was supposed that all four of the bor- 
dered complementary minors vanished. It was then shown that 
the fifth complementary minor (the unbordered minor that is 
the complementary minor of the element appearing in the 
first row and first column) did not vanish. 

In Case B it was assumed that three of the four bordered 
minors and the unbordered minor, denoted by ps, Ps), 
vanished. It was stated that Case B was exhausted by a study 
of two sub-cases, both of which were examined and found to 
contradict the hypotheses stated for Case B. It was concluded, 
then, that this case could not exist, and from this conclusion 
four interesting corollaries were stated. The principal interest 
of the paper seems to the writer to lie in these corollaries. 

A communication received from W. V. Parker called my at- 
tention to the fact that the hypotheses explicitly made upon 


* This Bulletin, vol. 37 (1931), pp. 752-758. 


| 


284 L. M. BLUMENTHAL [April, 


the elements of the determinant D, while sufficient for the 
validity of the theorem, are not sufficient to prevent the possi- 
bility of Case B occurring, and hence, the corollaries, dependent 
upon the impossibility of Case B for their correctness, are not 
true without an additional hypothesis upon D. This Parker 
showed by exhibiting a determinant which actually satisfied all 
the requirements stated, and which came under Case B. 

In the present paper we give a brief proof of the theorem with 
the conditions as stated in the original paper, in order to clear 
up any doubts that might exist about the validity of the theorem 
itself. Further, we make explicit the additional hypothesis upon 
the elements of the determinant D (implicit in the previous 
paper) and show in some detail that with this hypothesis the two 
cases examined under Case B are indeed the only two types that 
cannot be immediately rejected. These having been proved im- 
possible, the corollaries are seen to be valid. Finally, the last 
part of this note is concerned with obtaining the only class of 
determinants, of which the one due to Parker is a special case, 
for which the Case B does exist. It remains for me to express my 
thanks to him for his interest and his contribution. 


2. Proof of the Theorem. To prove the theorem itself, we con- 
sider again the two cases A and B. The elements 7;; having been 
replaced by (ij)?, where (27) is a positive number which may be 
thought of as representing the distance between a point p; anda 
point p; of a quadruple fi, pe, ps, ps, the assumption in Case A 
that all four bordered complementary minors vanish means, 
as stated in the previous paper, that all four of the triples of 
points contained in the four points p1, pe, ps, ps are linear. The 
determinant D being, by hypothesis, different from zero, the 
four points are not linear, and hence they form a pseudo-linear 
quadruple. (This is a result due to Karl Menger, explicit refer- 
ence to which was given.) The determinant E(pi, po, ps, ps) 
formed for a pseudo-linear quadruple is shown to be different 
from zero, and the theorem is proved for this case. 

In Case B we suppose that the four complementary minors 
assumed to vanish consist of three of the bordered minors and 
the unbordered minor E(p;, 2, ps, ps). We must then show 
that the remaining minor, a bordered one, does not vanish. 
This is immediate, for suppose that this minor vanishes. Then 


1932.] METRIC GEOMETRY AND DETERMINANTS 285 


all four of the bordered minors vanish, and hence the four points 
pi, Pe, Ps, Ps are either linear or pseudo-linear. But, since the 
determinant E(p;, pe, ps, ps) is assumed to vanish, the points are 
not pseudo-linear, for, from Case A we have seen that the deter- 
minant E(p1, pe, ps, ps) formed for a pseudo-linear quadruple 
does not vanish. Therefore, the four points must be linear. But 
this is impossible, since the determinant D vanishes for a linear 
set of points, and, by hypothesis, the determinant D does not 
vanish. Hence, the assumption that the remaining minor van- 
ishes is seen to lead to a contradiction, and the theorem is proved. 


3. The Triangular Inequality. We now state the hypothesis 
that is sufficient to prevent the existence of Case B. This hy- 
pothesis is that the elements r:;= (tj)? of D are such that each of 
the four triples Pi, 2, Ds; Di, Pe, Ps; Pi, Ps, Ds; Po, Ds, Ps contained in 
the four points pi, pe, ps, ps satisfy the triangular inequality.* 
This geometric way of stating the condition is made use of be- 
cause of its brevity; the condition may, of course, be stated in 
algebraic language. 

It is now to be shown that with this condition adjoined to 
those made explicitly in the theorem, the Case B cannot exist. 
Since the theorem itself has been established, we know that in 
the Case B, the remaining complementary minor does not van- 
ish. This is a bordered minor, and its non-vanishing means that 
the triple for which it is formed is not linear. We may assume 
the labelling so that this non-linear triple is p1, p2, ps. The other 
three triples being linear (since their minors vanish) we have 


(123)(231)(312) = 0, (234)(342)(423) = 0, (341)(413)(134) = 0, 


where by the symbol 7jk we mean 1j7+jk—ki, and the paren- 
theses indicate multiplication. 

There are, then, 27 possible cases to examine, corresponding 
to the different ways in which the three bordered minors can 
vanish. An examination of these 27 cases yields the following 
results: 

Twelve of the cases result in the fourth triple f:, pe, ps being 
linear, and hence are to be rejected, since we have seen that this 


* Three points are said to satisfy the triangular inequality if their distances 
are such that the sum of any two is greater than or equal to the third. 


286 L. M. BLUMENTHAL [April, 


triple is not linear, since the determinant formed for it does not 
vanish. 

Nine of the cases result in the fourth triple p:, po, p4 not satis- 
fying the triangular inequality, and are therefore to be rejected 
since we have supposed that all of the triples satisfy this in- 
equality. 

Two of the combinations are to be rejected since they imply 
that the sum of three distances equals zero. 

This leaves four cases to be examined. These four cases are: 


132 = 0, ‘en = 0, 312 = 0, 231 = 0, 
(a) 2134 = 0, (b) 4234 =0, (c)4342=0, (d) (423 =0, 
234 = 0, 341 = 0, 1134 = 0, 413 = 0. 


Case (a) is seen to reduce to sub-case alpha of Case B if we 
interchange the labels of p; and p;. Hence, it has been shown to 
be impossible. Case (b) is precisely the case treated in sub-case 
beta of Case B, and there shown to be impossible. Case (c) is 
merely another labelling of case (b) in which p; and #2 are inter- 
changed. Case (d) is reduced to case (c) if the letters p2. and p, 
are interchanged. Hence these four cases are also impossible and 
the Case B cannot exist. Therefore, with the additional assump- 
tion stated in this section, the corollaries of the theorem are 
valid. 

4. Possibility of Case B. We now obtain a class of determinants 
that may occur under Case B when we drop the assumption that 
each of the triples satisfies the triangular inequality. 

Consider the relations 312=0; 234=0; 413=0. If we put 
12=a, 14=c, 24=b, we obtain, by substitution in the above 
three equations, 13=(1/2)(—a+b-—c), 23=(1/2)(a+b—-o), 
34=(1/2)(—a+b+c). Since the distance (ij) is always posi- 
tive, we have the positive numbers a, b, ¢ satisfying the ine- 
qualities a+b>c, b+c>a, b>a-+c. It will be observed that the 
triple pi, p2, ps does not satisfy the triangular inequality. Now, 
by hypothesis, the determinant E(p;, ps, ps, ps1) vanishes. Sub- 
stituting the values obtained above, and developing the determi- 
nant, we obtain 


Epi, po, ps, Ps) = (1/16)(a + b+ ch(a+b—o) 
‘(a — b+ a+b + 6)%(2ab + 2ac + 2be — a? — — 


1932-] METRIC GEOMETRY AND DETERMINANTS 287 


Thus, it is seen, E(p1, pe, ps, ps) is zero if and only if 
a? + b? + c? — 2(ab + ac + bc) = 0, 


since the first four parentheses are seen not to vanish by virtue 
of the inequalities that a, b, c satisfy. 

Let us calculate the value of the determinant D formed for 
this case. It is found that 


D(p1, Pay ps) 
= — }(— a* — 2b? — 2c? + 3ab? + 3ac? + 2bc? + 2b%c — 6abc)?, 


which may be written in the form 
= 2ab—2ac — 2bc)(a+ 2b+ 2c) — 16abc]?, 


and, on account of the condition obtained by setting E equal to 
zero, this becomes D = — 32a7b?c?. 

In order to see that this constitutes the only class of determi- 
nants for which Case B exists, it is necessary and sufficient to 
examine each of the nine combinations referred to in §3 for 
which the fourth triple p1, p2, ; does not satisfy the triangular 
inequality. These nine combinations are 


312 = 0, = 0, = 0, 

(1) 234 = 0, (2) a = 0, (3) ‘ie = 0, 
413 = 0, 134 = 0, 341 = 0, 

= 0, = 0, = 0, 

(4) ee = 0, (5) ae = 0, (6) ‘am = 0, 
134 = 0, 413 = 0, 134 = 0, 

a = 0, = 0, oe = 0, 

(7) ) 234 = 0, (8) ses = 0, (9) en = 0, 

341 = 0 413 = 0, 341 = 


The first combination is the one just treated, and shown to 
exist. It is seen at once that the second combination may be 
obtained from the first by interchanging the letters p, and p2; 
while the third is similarly obtained from the first by inter- 
changing the labels of p; and f;. Hence these two combinations 
are equivalent to the first and do not require further treatment. 


i 


288 L. M. BLUMENTHAL [April, 


It will be found upon examination that the last six combina- 
tions are equivalent to the fourth combination, being obtainable 
from this combination by suitable changes in the labelling of the 
points. It will be sufficient, then, to examine combination (4). 

We have, writing 12 =a, 14=c, 24=b, and substituting in the 
relations (4), that 
23=3(—a+b+0), 
3(—a—b+¢), 
the positive numbers a, b, c satisfying the inequality c>a+b. 
The determinant E(p;, D2, ps, Ps), which vanishes, by hypothe- 
sis, takes the form, upon being developed, 


E(pr, p2, pa, p:) 
= — c)? 
-(a? + b? + c? + 2ab + 2bc — 2ac). 


13 
34 


II 


Now evidently none of the parentheses vanishes, and hence 
E(pi, pe, Ps, ps) is not zero, which contradicts the hypothesis. 
Hence, this case is not possible. 

Thus we have seen that if we assume the elements 7;; = (ij)? 
of the determinant D to be such that each triple of points satis- 
fies the triangular inequality, then only Case A is possible, 
and the four corollaries, stated at the end of the paper about 
which this note is written, are valid. If we do not make this as- 
sumption, then Case B can exist for only one type of combina- 
tion, the one investigated in this paper. We notice that in either 
event, the value of the determinant is given by —32a*b?c?, 
where a, b, c are positive numbers satisfying the appropriate 
conditions. 


Tue Rice INSTITUTE 


= 


1932.] POLYNOMIAL INTERPOLATION 289 


ON POLYNOMIAL INTERPOLATION TO ANALYTIC 
FUNCTIONS WITH SINGULARITIES* 


BY J. L. WALSH 


Méray has givenf the following illustration to show that 
polynomials formed from a given function by interpolation do 
not necessarily converge to that function. Interpolate to the 
function f(z) =1/z by means of the polynomials p,(z) of respec- 
tive degrees{ »=1, 2, 3, - - - , required to coincide with f(z) in 
the (~+1)th roots of unity; this condition defines the polyno- 
mials p,(z) uniquely. Moreover, we have 


(1) pn(z) = 2", 


because the equation p,(z) =1/z is satisfied provided z is one of 
the (n+1)th roots of unity. Even though the sequence p,(z) 
is defined by interpolation from the function f(z)=1/z, the 
polynomials p,(z) do not approach the function f(z) for |z| <i, 
as n becomes infinite, but approach the limit zero. It is naturally 
not surprising that these polynomials should fail to approach 
f(z) for Ee | <1, since f(z) has a singularity in that region; this 
sequence of polynomials fails to approach the limit f(z) even in 
a neighborhood of the curve |z|=1 on which interpolation takes 
place. 

In this connection, it is worth while to recall Runge’s result§ 
that if f(z) is analytic for | z| <1, then the sequence of interpolat- 
ing polynomials p,(z) of respective degrees m which coincide with 
f(z) in the (n+1)th roots of unity converges to the limit f(z) 
for | s| 


* Presented to the Society, March 25, 1932. 

+ Annales de I’Ecole Normale Supérieure, (3), vol. 1 (1884), pp. 165-176. 
This illustration is also presented by Montel (after Méray), in his Séries de 
Polynomes, 1910, p. 51. 

t A polynomial of the form aoz"++-a;2""!+ - - - +d, is said to be of degree n. 

§ Theorie und Praxis der Reihen, 1904, p. 137. This method of Runge’s 
has been more systematically developed by Fejér, Géttinger Nachrichten, 
1918, pp. 319-331, and by L. Kalmar, Mathematikait és Physikat Lapok, 
1926, pp. 120-149, but only for interpolation in points on the boundary of a 
region of a function known to be analytic in that region. 


== 


290 J. L. WALSH {April, 


Méray’s illustration is of such simplicity, directness, and 
beauty, and apparently has stood alone for so long, that it 
seems worth while to furnish it with some companionship. The 
purpose of the present note is to provide such companionship by 
the proof of the following theorem. 


THEOREM 1. Let f(z) be an arbitrary function continuous for 
|z| =1. Let the polynomials p,(z) of respective degrees n, (n=1, 
2,--- ), be defined by the requirement of coinciding with f(z) in the 
(n+1)th roots of unity. Then the sequence p,(z) approaches the 
limit 

1 
(2) = — 


t —2 


for |z| <1, uniformly for |z| <r<1. 

Lagrange’s interpolation formula for the polynomial p,(z) 
of degree which takes on the values Ki, Ko, - - - , Kn , at the 
n-+1 distinct points 2, 22, - , 2n411S 

Ke pz) 


pz) = 


where p(z) =(z—21)(2—22) - - - (—Zny1); the polynomial p,(z) 
is uniquely determined by these requirements. Under the cir- 
cumstances of the theorem, we set 2,=w*, where w=e??#/(nt+), 
p(z) =2"*1—1. It follows that we have 

+1 w*(gnt! — 1) 


(3 (w* 
3) pa(2) 


With the exception of the term z"*! in the numerator, which 
approaches zero, equation (3) suggests computation of the 
integral which appears in (2) by division of the circle C defined 
by |z| =1, at the points w*. We have 


1 f(w*)(w**! w*) 
(4) fiz) =lm— : 
no 271 k=l w* 


(5) lim [f,(z) — p,(2)] 


(un + 1)( — 1) 


= lim 


| 1 — j ntl — 1)f(o*) 


k=1 w* =—s 


1932.] POLYNOMIAL INTERPOLATION 291 


By (4), the summation on the right-hand side approaches the 
limit 27f,(z), which is continuous for |z | <1, and the limit is ap- 
proached uniformly for |z| <r<1.* The quantity (n+1)(@—1) 
approaches as its limit 277, for we have 


w = COS + isin ’ 
n+1 n+1 
2a 
cos——— — 1 sin 
(n + 1)(w — 1) n+1 1 n+1 
n+i1 n+1 


which approaches the limit unity. The square bracket in the 
right-hand member of (5) thus approaches zero for |z| <1, 
uniformly for | z| <r<i, and the factor of this bracket is 
bounded uniformly in z and n, |z| <r<1, so the proof of our 
theorem is complete. 

The theorem and proof are obviously valid if the given func- 
tion f(z) is not supposed continuous on C: \z| =1, but merely 
integrable in the sense of Riemann. 

The sequence #,(z) clearly converges uniformly in a region 
containing the curve C in its interior when and only when the 
given function f(z) or its analytic extension is analytic on and 
within C. The first part of this result was proved by Féjer (loc. 
cit.), or compare Walsh.f Reciprocally, if the sequence p,(z) 
converges uniformly for | s| <p>1, we shall prove f(z) analytic 
on and within C. The limit of the sequence ,(z) is analytic for 
lg | <p. The obvious equation Pmini1) (w*) =f(w*), (m=1, 2, - - -), 
where w=e?"‘/("t), implies the convergence of the sequence 
p.(z) to the function f(z) at each point w*, hence at a set of 
points everywhere dense on C. The continuity of f(z) for | =1 
and the analytic character of the limit of the sequence ,(z) for 
|z|<p then implies the identity on C of those two functions, 
hence the fact that f(z) or its analytic extension is analytic for 
|z|<p. If the sequence ,(z) converges uniformly for |z| <1, 
Ie) function f(z) is analytic for |z| <1, and continuous for 
\j2|S1. 


* Compare Runge, Acta Mathematica, vol. 6 (1885), pp. 229-244; Montel, 
loc. cit., p. 57; or Osgood, Funktionentheorie, 1928, pp. 579-581. 
t Transactions of this Society, vol. 34 (1932), pp. 22-74, §11. 


i 


292 J. L. WALSH [April, 


The following result is a not uninteresting complement to the 
main theorem already proved. 


THEOREM 2. The function f,(z) which is the limit for |z| <1 of 
the sequence p,(z) of the interpolating polynomials for f(z) is also 
the limit for |z| <1, uniformly for | z| <r<1, of the sequence of 
polynomials P,(z) of respective degrees n=1, 2,---, of best ap- 
proximation to f(z) on C in the sense of least squares. 


The polynomial P,(z) is that polynomial of degree n for which 
ae 
c 


is least. Such a polynomial is known to exist and be unique; 
it is defined* by 


P,(z) = co + +--+ + 


— 2)8*| dz| = — 


We have the relation 


1 f(idt 1 1 2? 
fis) = f= 
t 


(6) 


Cn 


2riJct—z Jc 


The infinite series converges uniformly in z and ¢ for t on C, 
|z| <r<1, and hence may be integrated term by term. Thus we 
have from (6) fi(z) =cotoz+c2?+ ---,a series which is uni- 
formly convergent for lz | <r<1; this includes the relation to 
be proved: 
= lim P,(z), <1. 

The polynomial P,(z) is the sum of the first 7+1 terms of the 
Maclaurin development of f(z). 

Several particular cases are worth mentioning. If there exists 
a function analytic for | z| <1, continuous for | s| <1, which 
coincides with the given function f(z) on C, then f;(z) naturally 
coincides with this function. If there exists a function F(z) analy- 
tic for |z| >1 (including the point z= ), continuous for | z| = & 
which coincides with the given function f(z) on C, then each 
C,(n >0) vanishes, and we have 


* See for instance Kowalewski, Determinantentheorie, 1909, §137. 


1932.] POLYNOMIAL INTERPOLATION 293 


lim ~,(z) = lim P,(z) = co = F(%), | z| <1 
here is included the case f(z) =1/z", k a positive integer (k =1 is 
Méray’s example), and we have f,(z)=0. More generally, if 
on C the function f(z) can be expressed as f(z) =f1(z) +f2(z), 
where f(z) is analytic for | z| <1, continuous for |z| <1, and 
where f2(z) is analytic for | s| >1 including the point at infinity, 
is continuous for | z| = 1, and vanishes at infinity, then we have 
lim p,(z) = lim P,(z) = f(z), | z| 
uniformly for |z| <r<1. If this function f,(z) is analytic for 
|z| <p>1, then we have 


lim P,(z) = fi(z), | z| <p, uniformly for| z| < p’ < p, 


no matter what f2(z) may be, but we have 


lim p,(z) = fi(z), | z| < p, uniformly for | 2| <p, 


if and only if f2(z) vanishes identically. 

Another illustration of the close connection between interpo- 
lation in the (7+1)th roots of unity and approximation on C in 
the sense of least squares has recently been indicated by the 
present writer (loc. cit.). 


If the function f(z) is analytic for | s| <T>1, then for the poly- 
nomials p,(z) and P,,(z) already defined we have 


lim [p,(z) — P,(z)] = 0, | z| uniformly for | 


even though f(z) has singularities for TS | z| 


It is not to be supposed (compare Kalmir, loc. cit.) that inter- 
polation in points arbitrarily chosen on C is always equivalent 
in the sense illustrated to approximation on C in the sense of 
least squares, even when the points of interpolation 2 for the 
interpolating polynomial p,(z) of degree are such that the limit 
of the maximum distance between successive points 2” on C for 
a given ” approaches zero with 1/n. We give an example to illus- 
trate this fact, where the +1 points of interpolation for the 
polynomial p,(z) are the roots of 


no 
n— 2 
n— 


294 J. L. WALSH [April, 


1 — az\"! 
(7) (~ ) =1,a>1. 


These points of interpolation are thus the transforms in the 2- 
plane of the (w+1)th roots of unity in the w-plane, under the 
transformation 

1 — az 
(8) w= 


which leaves C invariant and transforms the interior and exterior 
of C respectively into the interior and exterior of C. It is simpler 
to study the situation in the w-plane. We interpolate to the 
function f(z) =1/(w+T), 0<7T<1, in the points w, w*t!=1, 
by rational functions p,(z) = F,(w) of respective degrees (in 2 
or w) n whose poles coincide in the points w=a. The reader can 
easily verify the formula 


= F,(w) 
[(— 1)" + T*](w + T)(w — 


the expression in terms of z is found from (8). It appears from 
(9) that one may write 


(9) 


(T + — 1) 
[(— 1)" + + T)(w — a)" 
so that, for | | <1, we have actual convergence when and only 


when the condition 


(10) T+ta<|w-a| 


+ 


oe 


is satisfied. Condition (10) is equivalent to the condition that w 
should lie exterior to a certain circle whose center is a and which 
cuts C:|w| =1. Thus the sequence p,(z) converges in only a part 
of the unit circle |z| <1, and in that part converges to the origi- 
nal function 1/(w+T). 

It would be of interest to extend the main theorem of this 
note to the study of curves other than the unit circle; compare 
the references already given to Fejér and Kalmar. 


HARVARD UNIVERSITY 


= 
= 


1932.] RATIONAL QUINTIC SURFACES 295 


ANALYTIC STUDY OF RATIONAL QUINTIC SURFACES 
HAVING NO MULTIPLE CURVES 


BY H. N. HUBBS 


1. Introduction. The purpose of this paper is to derive the equa- 
tions of certain of the rational quintic surfaces without multiple 
curves discussed synthetically by Montesano.* The equations 
of the surfaces are found by applying Cremona transformations 
to certain well known rational surfaces of order three or four. 


2. Surface of Order Five with Four Triple Points. This surface 
is the transform by the cubic transformation 7;.,f of a general 
cubic surface ¢3 through the vertices of the tetrahedron. The 
equation of the surface is 


os = ye + + yeu’ + Ayoysys] 
+ + yoys(Bysys + you’”’)| 
+ yoysyalCysye + Dyoys + Eyzys] = 0, 


where u, u’, u’’, u’’’ are linear in (ys, ys), ¥4), (V2, 3), (y's, V4), 
respectively, and ¢2 is quadratic in (y2, y3),and where A, B, C, D 
and E are constants. The points whose coordinates are (1,0,0,0), 
(0,1,0,0), (0,0,1,0), (0,0,0,1) are triple points, with non-compos- 
ite tangent cones at each of the points. 


3. The Surface $5 with Three Ordinary Triple Points and a 
Tacnode. The surface ¢; is the transform by T;.; of a quartic 
surface with a double conic passing through three of the vertices 
of the tetrahedron and having the fourth vertex at a general 
point of the surface. The section of ¢; by a plane through the 
tacnode and two triple points is a straight line and a pair of 
conics passing through these points. 

The equation of $4 with a double conic is 


[ |? 4x2 [yo + xa, | = 0, 
(1,7,k R), 


* Montesano, Napoli Rendiconti, (3), vol. 7 (1901), pp. 67-106. 
{+ Hudson, Cremona Transformations in Plane and Space, Cambridge Uni- 
versity Press, 1927, pp. 301-303. 


= 


296 H. N. HUBBS [April, 


where y; and yz are linear and quadratic, respectively, in the 
variables (x1, x2, x3). The surface ws is 


os = — 4yahs — = 0, 


where ¢; is linear in (¥1, ye, ys), Ws is quadratic, and ¢@- linear in 
(yo¥s, ¥1¥3, ¥i¥2). The point (0, 0, 0, 1) is a tacnode on ¢5, and 
(0, 0, 1, 0), (0, 1, 0, 0), (1, 0, 0, 0) are triple points with non- 
composite tangent cones. 

4. The Surface 6; with Three Ordinary Triple Points and a 
Tacnode, the Tacnode lying with one of the Triple Points on a 
Line of 6; Situated in the Tangent Plane at the Tacnode. This 
surface is the transform by Tt: of a rational surface ¢, of order 
four, having a double line and two double points in a plane 
through the double line. One fundamental point of the trans- 
formation is on the double line, two at the double points, and 
the fourth a general point of $s. The plane of the double points 
and double line is tangent to $4 along the line joining the double 
points. The transform of $4 by Tc: is 


= (y2 — + (y2 — + + vs = 0, 


where yo, Ys, and ys are forms in (yi, ye, y3) of the order of their 
subscripts. The triple points are then (1, 0, 0, 0), (0, 1, 0, 0), 
(0, 0, 1, 0), and the tacnode is (0, 0, 0, 1). 


5. The Surface $; with Two Triple Points and Two Tacnodes. 
A space Cremona transformation of order three is defined by the 
web of surfaces of order three passing through three fixed 
conics ke, k?, kf’ which lie in distinct planes, have one point 
in common, and meet by pairs in three points. The conjugate 
system is defined by cubic surfaces having in common three non- 
concurrent coplanar lines and a space cubic curve meeting each 
line once. 

The inverse of this transformation carries a quadric surface 
through one of the lines and the intersection of the other two 
into a surface W; of order five having two of the fundamental 
conics as double curves and the third a simple curve. The 
transformation 7;.; with fundamental points at the intersec- 
tions of these conics carries y; into a surface ¢; which has two 
triple points and two tacnodes. The tacnodes are the images 
of the double conics of Ws. 


| 
| 
| 


1932-] RATIONAL QUINTIC SURFACES 297 


The equation ¢; obtained by the above procedure is 

os = ye + Ayiystte + Byryous + Cyiyeys 
+ u;(Dysu2 + Dy2us + Eyzys) | 
+ yal +A ViV3U5 + Byiyous + ui(F y3tts + 
+ us(Fysu2 + Dyous + Evoys)] + uausus = 0, 


where 
bsc4 
baye + V3; uy = + bsys) + == Mise, 
2 ce 
a3C4 a3C2 
te = + us = + asy3) + — Yea, 
Ci Ci 
Uz = + ; Ye, = yi(deye + + 
1 1 
and the double conics of the first transformation are 
x, = 0, = 0, (i,j,k = 2,3,4,---,tA 7A hk), 


t2=0, = 0, (i,j,k = 1,3,4,---, 
ws=0, =0, = 1,2,4,---, b). 


The points (0, 1, 0, 0) and (0, 0, 1, 0) are tacnodes and the points 
(1, 0, 0, 0), (0, 0, 0, 1) are triple points. The line y2=0, y;=0 lies 
on the surface. 


6. The Surface ¢; with One Triple Point and Three Tacnodes. 
The inverse of the first transformation of §5 carries a quadric 
surface through the intersections of the fundamental straight 
lines into a surface WY, of order six having the fundamental 
conics as double curves, their common intersection a four-fold 
point, and the three points of intersection of these conics by 
pairs three-fold points. The transformation 7,2, with funda- 
mental points at these multiple points, carries Y into a ¢; with 
one triple point and three tacnodes. The procedure described 
above gives 


os = yi + + yous + 
+ yyuo(us + ys) + + + 
+ + yous + yoys) + + yitts(us + ys) 
+ yiyous| + + yotts) + = 0, 


298 H. N. HUBBS [April, 


where uw; are as indicated in §5. The point (0, 0, 0, 1) is an 
ordinary triple point and the points (1, 0, 0, 0), (0, 1, 0, 0), 
(0, 0, 1, 0) are tacnodes. The tangent cones at the tacnodes are, 
respectively, 


(devo + + days)? = 0, + bays + bays)? = 0, 
(e191 + = 0. 


7. The Surface 6; with an Ordinary Triple Point and a Tac- 
nodal Triple Point. A Cremona transformation is defined by the 
web of quadric surfaces containing a fixed conic and an arbi- 
trary point. A special case of this transformation arises when the 
point is on the fixed conic. 

If the conic of this web is tangent to a rational quartic surface 
with a double line at a general point of the line, the transforma- 
tion defined by the web carries the quartic surface into a quintic 
having an ordinary triple point and a triple point with an ad- 
jacent infinitesimal double line,* or tacnodal triple point, both 
lying on the fundamental conic. 

The equation of the quintic obtained by the above transfor- 
mation is 


= ve vil yi, ¥2) + M1, V2, ¥2) + 2) 
+ yhalyi, v2) + ye) = 0. 


The point (0, 0, 1, 0) is an ordinary triple point and (0, 0, 0, 1) 
a tacnodal triple point. In the plane y;=0 are five straight lines 
on the surface, images of the residual intersections of the funda- 
mental conic and the quartic surface. On @; are eighteen conics 
passing through the triple points and lying by pairs in nine 
planes through the triple points. 


8. The Surface 6; with One Tacnodal Triple Point and One 
Tacnode. In the following the vertices of the tetrahedron of 
reference are the points A;, those of the conjugate system B,, 
(i=1, 2, 3, 4). 

A rational surface of order four of the third type of Noether 
has a double point A, which is a cusp in a general plane section 
through it. The surface has a simple line passing through the 
double point; a general section through this line is a cubic curve 


* Segre, Annali di Matematica, (2), vol. 25 (1896), pp. 1-53. 


| 

| 


1932.] RATIONAL QUINTIC SURFACES 299 


having the line as inflectional tangent at Ay. The tangent cone 
at A,is the plane p taken twice; this is tangent to ¢, along the 
line. A section by this plane is a conic tangent to the line at A4. 
In the transformation of §7, let the conic be tangent to 4 
at T= Fo=C2A sp, Fi =Cy Under : BiB}, 
where B; is a tacnode, and B, is a tacnodal triple point. The 
conic C? is on @¢;. As in (7), the point B, is a tacnodal triple 
point, the tangent planes being p’ taken twice, and p; contain- 
ing Ci 
The equation of the quintic surface obtained by this method 
is 
+ + $3 — yi(yiye + + y?)] 
+ = yr + ye + y?) + $3) | 
— yeve + + + viCs = 0, 


where ¢;, $3, and C; are forms in (y;, ye) of the order of their 
subscripts. 

The point (0, 0, 0, 1) is a tacnodal triple point and (0, 0, 1, 0) 
is a tacnode lying on a line of ¢; situated in the simple tangent 
plane at the triple point. The section of ¢; by the simple tangent 
plane at the triple point is the conic y3y;— y? =0 and the line ye 
=0 counted three times; this line is the image of the simple 
line of 4. 


9. The Surface ¢; with an Ordinary Triple Point and an 
Oscnode. A quadratic Cremona transformation is defined by the 
quadric surfaces F2 having in common two generators and oscu- 
lating at their point of intersection.* Let the generators be ]; 
and /, and their point of intersection A;. The transformation is 


T = [Fetth, le, Ar, =U, 14, Bil. 


Let A; bea generic point on a monoidal quartic surface ¢,, and 
let 11, lg each osculate ¢, at A;. Let the triple point of ¢4 be As. 
Under T,¢: ~ ¢; with an ordinary triple point at B, and an osc- 
node at 

A general straight line through A, has three residual inter- 
sections with ¢,; hence the image straight line has three inter- 
sections with ¢; not at B,;. A general straight line meets ¢, in 


* Hudson, loc. cit., pp. 197-198. 


— 
= 


300 H. N. HUBBS [April, 


four points; hence its image conic has four points in common 
with ¢; not at B,; that is, at B, are three consecutive double 
points on the image conic or B, is an oscnode on ¢;5. The equa- 
tion of is 
os = t+ = 0, 
where 
¥3=Ax} Ws, 
Bx? tM, 
and where w;, u; are binary forms in (x2, x3) of order 7. The line 
x2=0, x;=0 osculates ¢, at (1, 0, 0, 0). The equations of trans- 
formation are 
= Bl yiys — Yoys), pX2 = Ayeys, pXs = pXs = 
The equation of the resulting quintic is 
B*(yiye — yoys)?[ABy: + wi] 
+ — yoys)[weys + us] + yal ways + us] = 0, 
where w;, “;, are the above forms in (ye, y3). The point (0, 0, 0, 1) 
is a triple point and (1, 0, 0, 0) is an oscnode. 


10. The Surface 6; with an Oscnode and a Tacnode. Applying 
the transformation of §9 to a quartic surface of the first type of 
Noether, we find 


os = — yoys)?[ABy: 
+ B(yivs — yoys) + us] + ways = 0, 
where u; are binary forms in the variables (ye, y3) of order 7. The 


point (0, 0, 0, 1) is a tacnode and (1, 0, 0, 0) an oscnode. 


11. The Surface @; with an Oscnode and a Double Point of the 
First Order. Such a surface is the transform by the above trans- 
formation of a quartic of the second type of Noether. The equa- 
tion of this quintic is 

os = o2 + + + 43] 
+ yal yt + — + Foi + D(2Fy: + Gy2) 
+ + yore + + yi oi } 
+ + AB y? = 0, 


1932.] RATIONAL QUINTIC SURFACES 301 


where 
D=b,B, E=hA, F=aB, G=a@A, ¢:=Dy:+Ey, 
and where 
vi = + G— 
= AB*y? + B*Ksy? ye + BKayiy? + Ksy?, 
= — 2AB*y? + B(Ke — 2BK)yiy2 — BKiy?, 
oi = BK7zy, + (Ks — BK,)ys, 
vs = (2DF — BK,) yoy? + B(BKs — Ke) y? ys — 2Fy? — BKzy3. 


The point (1, 0, 0, 0) is an oscnode, and (0, 0, 0, 1) is a double 
point of the first order. 


12. The Surface @; with an Oscnode and a Double Point of the 
Second Order. Such a surface is the transform of a quartic sur- 
face of the third type of Noether. The equation of this quintic is 


os = — yoys)2b1 + Blyiys — yoys) + 
+ + 2] = 0, 


where @; is linear in all variables, and where ¥, ¢2, 3 are forms 
in (ye, y3) of the order of their subscripts. The point (1, 0, 0, 0) 
is an oscnode, and (0, 0, 0, 1) is a double point of the second 
order. 


13. The Surface @; as a General Member of a Homaloidal 
Family of a Cremona Space Transformation. The quintic sur- 
faces discussed above are all rational and their (1, 1) repre- 
sentations on a plane 7 are known. Cremona* has shown that the 
set of space transformations having ¢ as a general member of the 
first homaloidal family corresponds to a set of plane Cremona 
transformations in 7. 

If, by a Cremona transformation, ¢, is the transform of a 
rational y,, which is a general member of the homaloidal fami- 
lies of k Cremona transformations, then ¢, will serve as the 
general member of the homaloidal families of k Cremona trans- 
formations. 


* Cremona, Istituto Lombardo Rendiconti, (2), vol. 4 (1871), pp. 269-279; 
and Annali di Matematica, (2), vol. 5 (1871), pp. 131-162. 


= 


302 H. N. HUBBS [April, 


14. Conclusion. Each of the rational quintic surfaces dis- 
cussed above can serve as a general member of the homaloidal 
family of a Cremona transformation. The surfaces of §§5, 6, 9 are 
the transforms, by Cremona transformations, of a general 
quadric surface. The surface of §2 is the transform of a general 
$3; that of §10 is the transform of a quartic surface of the first 
type of Noether which is the transform of a general $3. The 
surfaces of §§8, 12 are the transforms of a quartic surface of 
the third type of Noether which is the transform of a special 
quartic of the first type of Noether with a double point in the 
tangent plane at the tacnode; this plane is tangent to the sur- 
face along the line joining the double points,* and this surface 
is a transform of a general $3. The surfaces of §§7, 11 are trans- 
forms of a quartic with a double line. The surface of §4 is the 
transform of a quartic with a double line and two double points 
coplanar with the double line; the homaloidal family of which 
this }; is a general member have in common the double line and 
also have contact along the line joining the double points. The 
surface of §3 is the transform of a quartic with a double conic.{ 


CoRNELL UNIVERSITY 


* Noether, Mathematische Annalen, (3), vol. 33 (1889), pp. 546-571. 
+ Montesano, Roma Rendiconti, (4), vol. 5—2 (1889), pp. 123-130. 
t Aroldi, Giornale di Matematiche, (3), vol. 11 (1920), pp. 175-192. 


1932.] THE PROBLEM OF MAYER 303 


ADJOINT SYSTEMS IN THE PROBLEM OF MAYER 
UNDER GENERAL END-CONDITIONS* 


BY S. B. MYERS 


1. Introduction. Necessary conditions in the problems of 
Lagrange and Mayer with variable end points, analogous to the 
Euler equations and transversality condition in the ordinary 
problem of the calculus of variations in the plane, have recently 
been given by Morse and the author.j The methods used in the 
derivation of these conditions, as well as those used by Blisst 
in his derivation of somewhat more complicated transversality 
conditions in the same problem, consist primarily of certain 
proofs which more properly belong to the theory of differential 
equations. 

In the present paper, certain adjoint relationships in the 
problem hitherto unnoticed are pointed out which make it pos- 
sible to derive the necessary conditions mentioned above merely 
by a consideration of the compatibility of the adjoint system of a 
certain set of differential equations and boundary conditions. 
In fact, in the anormal case it is shown that these necessary con- 
ditions form an adjoint system of the differential and terminal 
equations of variation. The normal case is treated essentially by 
making it the abnormal case of a slightly different problem. 
Such considerations have the advantage of referring a major 
part of the proofs to the theory of differential equations, and 
thus greatly simplifying them. 

2. The Definition of the Adjoint System. Consider the set of m 
linear homogeneous differential equations of the first order§ in 
the variables --- , mn, 


(1) L3(n) = pani + = 0, 
(8 =1,---,m;i=1,---,n), 


* Presented to the Society, April 3, 1931. 

7 Morse and Myers, Proceedings of the American Academy of Arts and 
Sciences, vol. 66 (1931), pp. 235-253. 

t Bliss, Transactions of this Society, vol. 19 (1918), pp. 305-314. 

§ The usual convention of summation with respect to an index repeated 
in the same term is used throughout. 


304 S. B. MYERS [April, 


where the functions $,,(x), gai(x) are continuous on the interval 
x'<x<x* and the matrix lI bs .(x)|| is of rank m throughout this 
interval. 

By the adjoint set of equations we shall mean the following set 
of n equations in the m dependent variables Ay, - - - , An: 


d 
(2) MA) = + = 0, 
x 


(8 =1,---,m;i=1,---,m). 


By a continuous solution of these quasi-differential equations* 
is meant a set (A) of m continuous functions of x such that the 
functions 3,3 are of class C’ and the equations are satisfied. 

We note the following identityf which holds for arbitrary 


functions (ny) =(m, - - - , 7x) of class C’ and arbitrary continuous 
functions (A) = (Ai, - - - , Am) such that the functions pg,\g, are of 
class C’: 


1 


(3) 


Here 7»; represents the value of ;(x) at x=x*, (s=1, 2), and Ag 
the value of \3(x) at x =x". 

Consider any set of homogeneous linear boundary conditions 
on 7;. By a set of adjoint boundary conditions of these conditions 
shall be meant a set of linear homogeneous boundary conditions 
on Ag with the following properties. 

I. The bilinear form [pesmi |, vanishes for all 4g which are 
subject to these conditions while the 7; are subject to the original 
boundary conditions. 

II. Of all sets of conditions with the above property (I), these 
conditions restrict \g as little as possible; that is, the points in 
2m-space with coordinates )g fill out a subspace of maximum 
possible dimensionality. 

The original boundary conditions may be written in the form 


(4) ni = Cinta, = 1,---, 735 = 1,2;4=1,---,m), 


where (7) u,) are parameters which are allowed to 


* See Bécher, Lecons sur les Méthodes de Sturm, p. 10. 
+ The notation [| ]; means the value of [ ] for s=2 minus its 
value for s=1. 


= 


1932.] THE PROBLEM OF MAYER 305 


take on any values whatever, and ci, is a (2ur)-dimensional 
matrix of constants. 


THEOREM 1. A necessary and sufficient condition that a set of 
homogeneous linear boundary conditions on dg in the form 


(5) bss = 0, (6 = 1,---, p; p arbitrary), 


be a set of adjoint boundary conditions of (4) is that they be 
equivalent to the following conditions: 


(6) = 0, =1,---,7). 


The proof is simple, and will be left to the reader. 
Equations (2) and conditions (6) together form an adjoint 
system of (1) and (4). 


3. Compatibility of the Adjoint System. In applying our defi- 
nition of an adjoint system to the problem of Mayer, the chief 
interest will lie in the question as to when an adjoint system has 
a solution. To answer this question, we start by adjoining to the 
differential equations (1) »—m more differential equations so 
that the extended set of equations becomes 


(7) peini + = 
(fs = 0), (6 = 1,---,m;14,k =1,---,m). 
Here the functions q,i(x), p,i(x), (r-=m+1,---,), are any 


continuous functions subject only to the restriction that the 
determinant | pii(x)| be different from zero along (x!, x?).* 

If we have any solution (n) of (1) of class C’, we can use (7) 
to define continuous functions ¢{,(x) which we will say cor- 
respond to (yn). On the other hand, we can choose the functions 
¢, as arbitrary continuous functions of x and obtain a solution 
(n) of (7) of class C’ taking on arbitrary values at one end point. 
This set (7) will automatically be a solution of (1). 

We now write down an extended adjoint set of equations 


d 
(8) — = 0, (1,2 = 1,---,m). 
x 


We will use later the fact that these equations have a continuous 


* For a proof of the possibility of this adjunction, see Bliss, loc. cit., p. 312. 


= 
= 


306 S. B. MYERS [April, 


solution (A;,---, An) such that the functions p:iA, take on 
arbitrary values at one end point (since | Pei| is not zero). 


Lemma. Between functions n(x) of class C’ satisfying (7) with 
the corresponding ¢'s and continuous solutions dx of (8) we have 
the following relation: 


2 


1 
For it is easily verified that the relation 


d 
ni) — = 0, (i,k = 1,2,---n37r = m+1,---,2), 


ax 


holds, and we need only integrate this to obtain (9). 


THEOREM 2. A necessary and sufficient condition that the ad- 
joint system (2), (6) have a continuous solution (Ai, - - - , Xm) not 
sero at any point of (x, x*) is that all determinants of the form 


(2n columns :j7 = 1,--+, 2n), 
(A) | Mig — Nii} » 
(2n rows :i = 12); 
vanish, where nin, --- , Nien are the end values of 2n sets of solu- 
tions of class C’ of the original differential equations (1) and 
Nits °° * > Nizn are 2n sets of values which satisfy the boundary 
conditions (4) with 2n sets of values of the parameters (u). 


To prove the necessity of the condition, assume that there is 
a continuous solution (Ai, ---, Am) of (2) and (6) not zero at 
any point of (x', x”). Then the set isa 
solution of (8) and the preceding lemma proves that 


(10) [pardani]; = 0 

If we multiply equations (6) respectively by arbitrary con- 
stants 1,---, u, and add, and subtract from (10), we obtain 
(11) — cinta) = 0, 


which may be written as 


(12) [pada(n: — = 0 


= 
— 


1932-] THE PROBLEM OF MAYER 307 


The 2n coefficients pg}Az', pg?A¢ are not all zero, for other- 
wise we would have \3=0, which is contrary to hypothesis. 
Hence all determinants of form (A) must vanish, and the con- 
dition of our theorem is necessary. 

Now suppose that all determinants of form (A) vanish. Then 
there must exist constants m; not all zero such that the equation 


(13) mi(né — nv) = 0 


holds for all solutions 7;(x) of class C’ of (1), and all solutions 
i; of (4). This is proved as in Morse and Myers (loc. cit., p. 243). 
Using (4), we may write this in the form 


(14) mi(ni — = 0. 
Let (Ai,---, An) be any continuous solution of (8) not 
identically zero. Then (Ax, ---, An) cannot all be zero at any 


one point of (x', x?) because of the homogeneity of (8). Subtract 
(9) from (14). We obtain 


(15) (m? — + + 


= 1,2,---,7;7 = m+ 1, ---,n). 


According to the sentence following (8), we may consider 
p22 as having arbitrary values. Furthermore, according to the 
remarks immediately preceding (8), we may consider {, as 
arbitrary continuous functions of x, and n? as having arbitrary 
values. The values of the parameters wu, are also arbitrary. 
Let us choose p;,/A2 = —m?. Then if we choose temporarily 
? =0 and u,=0, we find that \,=0. Equation (15) may now be 
written in the form 


2 2 8 8 
(16) ni(m; = — = 0. 
From the arbitrariness of u;, and n? we deduce that 
2.2 2 88 
= Mi, = O. 
Thus the conditions 


d 
= 0, — anid = C 
x 


— 


308 S. B. MYERS {April, 


are satisfied with A,=0, ("-=m+1,---, 2), or, in other words, 
conditions (6) and equations (2) have a continuous solution not 
zero. Hence the condition of our theorem is sufficient. 

4. The Problem of Mayer. Let us consider arcs y;=y;(x) of 
class C’ in (w+1)-space, let us denote the end points of these 
arcs by (x*, (x*, y*), where s=1 at the initial end 
point and s =2 at the final end point. 

Such arcs which satisfy the differential equations* 


(17) os(x, y, 9’) = 0 


are called differentially admissible arcs. The functions $3 are of 
class C’’. Arcs of the same type which satisfy, for some value of 
the parameters (a) = (a, - - - ,@,), the conditions 


(18) = x(a), yi* = 9:*(a), 


will be called terminally admissible arcs. The end point functions 
x*(a), y:(a) are to be of class C’. An arc which is both differenti- 
ally and terminally admissible will be called admissible. A set 
(a) will be called admissible if it determines through (18) the 
end points of some admissible arc. 

Let E be an admissible arc y;=7;:(x), a'S x Sa’, which 
satisfies (18) for (a) =(0) and along which the functional matrix 
'\day,l| is of rank m. We seek conditions under which E and the 
set (a) =(0) afford a minimum for a, among admissible sets (a) 
and the corresponding admissible arcs. 

This simple form of the problem is new. It produces sym- 
metrical results and proofs, and makes clear certain adjoint 
relationships in the problem. It is as general as any other form 
previously used in the sense that any of the other forms can be 
put into this form by simple transformations. The general form 
used by Morse and Myers (loc. cit.), in which the expression to 
be minimized is 


f 


can be put into the form used in this paper by introducing a new 
dependent variable y) and a parameter ao, and adjoining the 
new differential equation yo’ —f(x, y, y’)=0, and the new end 


* We use the notation (x, y, y’) for (x, Yn) )- 


— 
= 


1932.] THE PROBLEM OF MAYER 309 


conditions y? =0, y? =a o—6(a@). The expression to be minimized 
becomes simply a. 


5. The Euler-Lagrange Equations and the Transversality Con- 
dition. A set of functions ;(x) of class C’ which satisfy the 
differential equations of variation 


(19) dpy/Ni + dpyni = 9, (x, ¥, y’) on E, 


will be called a set of differentially admissible variations. 
We shall attach the subscript h to x*(a) or y;(@) to denote 
differentiation with respect toa,. Let us set 


8 


Cih = Yin — Fi (h =1,---,7r). 


Then a set of 2” numbers 7; which satisfy the terminal equations 
of variation 


(20) ni Cintty 


for some set of numbers () = (a, - - - , u,) will be called a set of 
terminally admissible variations and will be said to be determined 
by the set (x). 

Let be any matrix 
consisting of 2n+2 columns of differentially admissible varia- 
tions. Let y?, y? be the jth of 2n+2 sets of 2 arbitrary con- 
stants. Then the following theorem can be proved.* 


THEOREM 3. There exists a (2n+-2)-parameter family of differ- 
entially admissible arcs 


Vi = VilX,@1, °° , Conte), 
(21) = mee) 


(€1,°- Conte) SX S (C1, -- - , Conse), 


containing the arc E for (e1,-- , €2n42) =(0,---, 0), and such 
that 


Xoe;(0, 0) = Vis 0) nii(X), 
(jg =1,---, 2). 


The functions y:, Viz, and xo are of class in - - , €2n+2- 


* See Morse and Myers, loc. cit., p. 239. 


= 


310 S. B. MYERS [April, 


Now let u,;, 2n+2), be sets 
of r—1 arbitrary constants. Consider the equations 


(22) 


These equations are conditions for terminal admissibility and 
they have the initial solution (a, - - , €2n+2) =(0,0,--- ,0). 
If the jacobian of the left hand sides of equations (22) with re- 
spect to é:,--°- , €zn42 were different from zero at this solution, 
the implicit function theorems would show that (22) would 
have solutions for (e, - - - , €sn+2) as functions of a; for all values 
of a; in the neighborhood of a,=0. That is, there would be ad- 
missible arcs in the family (21) giving a; a value smaller than 
zero. This would contradict the fact that E is a minimizing arc. 
Hence we conclude that the jacobian must be zero. 

If we attach the subscript p to x* or y; to represent differ- 
entiation with respect to a,, the jacobian in question is 


1; | (p = r)» 

(23) , (2n + 2columns ;7 = 1,---,2u+ 2), 
| 8 s 8 

Vit — Viptpj| (2n+ 2rows ;s = 1,2;7= 1,---,7). 

Let us set (D=2,---, 7). If we mul- 


tiply the first two rows of the Jacobian respectively by 7! 
and ¥/? and subtract them respectively from the next u rows 
and the last 7 rows, we obtain 


Yi 
(24) 
nig — | 


This holds for arbitrary constants y;, for any sets of differ- 
entially admissible variations 7;;(x), and for arbitrary constants 
pj. 

The arbitrariness of y; enables us to deduce from (24) the 
following necessary condition: 


(25) | nis — = 0, (6 = 1,---, 2n), 


where 74z(x) is the 6th of any 2m sets of differentially admissible 


= 


1932.] THE PROBLEM OF MAYER 311 
variations, and where 1,5 is arbitrary, 7; being given by the 
following: 

(26) Nis = CipUps, (p = 2, 


But, according to Theorem 2 of §3, equation (25) is a sufficient 
condition that the following adjoint system of (20) and (26) 


have a continuous solution (Ai, --- , Am) not zero at any point 
of (a}, a?): 
(28) = 0, (p = 2,---,7). 


But from (28) follows an identity in the differentials da,; 


=0, (p= 2,---, 2). 
Now cj,da, =dy!— (h = 1,2, -- - ,1r), and so we obtain 
(29) = dx + Kda; = 0, (K = constant). 


These results may be summarized in the following theorem. 


THEOREM 4. If E and the set (a) =(0) afford a minimum in the 
problem, there exist a constant K and a set of continuous functions 


A(x), Am(x), not all zero atany point of a”), such that if 
we set F =\gdz, the equations 

d 
(30) Fy, =O 

dx 


are satisfied at every point of E, and the fellowing identity in (da) 
holds: 


(31) [Fyv(dy; — sida]; + Kday = 0, 


when dx* and dy; are expressed in terms of (da) and evaluated for 
(a) = (0). 


6. Normalcy. An admissible arc E is said to be normal relative 
to conditions (30) and (31) if there exists no set of continuous 
functions \s#0 with which it satisfies these conditions with 
K =0; that is, if it does not satisfy the differential equations 
and boundary conditions 


S. B. MYERS [April, 


d 
dx 


(33) = 0, = 1,---,7). 


Otherwise E is said to be anormal relative to these conditions. 
From Theorems 1 and 2 the following theorems are evident. 


THEOREM 5. The system (32) and (33) is an adjoint system of 
the differential and terminal equations of variations (19) and 
(20). 

THEOREM 6. A necessary and sufficient condition for normalcy 
relative to (30) and (31) is that there exist a non-vanishing 
determinant of the form 


Nis — Nis} » (6=1,---,2n;4=1,---,#;s = 1, 2), 


where nis(x), (¢=1,---, m), is the bth of any 2n sets of differ- 
entially admissible variations, and ij; is the 5th of any 2n sets of 
terminally admissible variations. 


HARVARD UNIVERSITY 


312 


OFFICIAL COMMUNICATIONS 


Meetings of the Society have been fixed at the following 
times and places: 


On account of the fact that the Summer Meeting of the 
Society in 1932 will be held on the Pacific Coast, the meetings 
that were to have been held at Stanford University (April, 
1932), at Seattle (June, 1932), and at Los Angeles ( November, 
1932) will be omitted. 


Los ANGELES, CALIFORNIA, SUMMER MEETING AND COLLO- 
guium, August 30-September 2, 1932. 


Abstracts must be in the hands of Associate Secretary M. H. In- 
graham, University of Wisconsin, Madison, Wis., not later than July 26. 
Abstracts received by July 7 will appear in the July issue, or an earlier 
issue, of this Buttetin. Professor J. F. Ritt will deliver a series of col- 
loquium lectures on Differential equations from the algebraic standpoint. 
By invitation of the program committee, Professor D. N. Lehmer will 
speak on The continued fraction representing cubic and higher irration- 
alities, and Professor Tibor Rado will speak on Recent work in the prob- 
lem of Plateau. 


New York City, October 29, 1932. 


Abstracts must be in the hands of Associate Secretary Tomlinson Fort, 
501 West 116th St., New York City, not later than October 8. In order 
to be printed in the BuLietin in advance of the meeting, abstracts must 
be in the hands of the Associate Secretary not later than September 7. 
By invitation of the program committee, Professor Jesse Douglas will 
speak on The problem of Plateau. 


AtLantic City, New Jersey, ANNUAL MEETING, December 
27-30, 1932. 


Abstracts are due by November 30, 1932. 
R. G. D. Ricnarpson, Secretary of the Society. 


Articles for insertion in the BuLietrn should be addressed to E. R. 
Hepricx, Editor of the BuLtetin, University of California at Los An- 
geles. Reviews should be sent to W. R. Lonciey, Yale University, New 
Haven, Conn. Notes should be sent to H. W. Kunn, Ohio State Uni- 
versity, Columbus, Ohio. 

Subscriptions te the BuLLeT1n, orders for back numbers, and in- 
quiries in regard to non-delivery of current numbers should be addressed 
to the American Mathematical Society, 450-459 Ahnaip St., Menasha, 
Wis., or 501 West 116th St., New York. 

The initiation fees and the annual dues of members of the Society 
(see this BuLLetin, i 322, May, 1930; and the List of Officers and 
Members, October, 1930, p. 58), are payable to the Treasurer of the 
Society, Professor G. W. Mullins, 501 West 116th St., New York City. 


i 

4 


WHOLE NUMBER 391 


CONTENTS 


The Calculus of Variations and the Quantum Theory. By 


Statistical Mechanics and the Second Law of Thermo- 
dynamics. By P. W. BRIDGMAN 

Note on Diophantine Automorphisms. By LEONARD 
CARLITZ 

On a Matrix Differential Operator. By A. K. MITCHELL 

Solutions of Bounded Variation of the Fredholm-Stieltjes 
Integral Equation. By L. J. Parapiso 

On Real Symmetric Determinants whose Principal Di- 
agonal Elements are Zero. By W. V. PARKER 

On the Degree of Generality of a Class of Arithmetical 
Identities. By E. T. BELL 

On the Moment Problem for a Finite Interval. By T. H. 
HILDEBRANDT 

New Diophantine Automorphisms. By LEONARD CARLITZ 

Note on the Diophantine Equation. ax?+-by*+-c2*+d?f =0. 
By J. L. MoRDELL 

Note on an Application of Metric Geometry to Determi- 
nants. By L. M. BLUMENTHAL 

On Polynomial Interpolation to Analytic Functions with 
Singularities. By J. L. WALsH 

Analytic Study of Rational Quintic Surfaces having no 
Multiple Curves. By H. N. Husss 

Adjoint Systems in the Problem of Mayer under General 
End-Conditions. By S. B. MyErs 


246 
251 


255 
259 
263 


269 
271 


277 
283 
289 


295 


For official Communications and Notices, see the inside of the back cover. 


GEORGE BANTA PUBLISHING COMPANY, MENASHA, WISCONSIK 


PAGE 

225 

|_| 
303 


. ‘ 
= > 


